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(57) Abstract 

The present invention relates to the discovery in eukaryotic cells, particularly mammalian cells, of a novel family of cell-cycle 
regulatory proteins ("CCR-proteins"). As described herein, this family of proteins includes a polypeptide having an apparent molecular 
weight of 16 kDa, and a polypeptide having an ^parent molecular weight of ^proximately 15 kDa, each of which can function as an 
inhibitor of cell-cyle progression, and therefore ultimately of cell growth. Thus, simQar to the role of p21 to the p53 checkpoint, the subject 
CCR-proteins jnay function coordinately with the ceU-cycle regulatory protein, retinoblastoma (RB). Furthermore, the CCR-protein family 
includes a protein having an apparent molecular weight of 13.5 kDa (hereinafter •tl3-5"). Tbe presumptive role of pl3.5, Uke pl6 and 
pl5, is in die regulation of the ceU-cycle. 
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CELL-CYCLE REGULATORY PROTEINS, 
AND USES RELATED THERETO 



Background of the Invention 

5 

Neoplasia is characterized by deregulated cell growth and division. Inevitably, 
molecular pathways controlling cell growth must mteract with those regulating cell division. 
It was not until very recently, however, that experimental evidence became available to bring 
such connection to light. Cyclin A was found in association with the adenovirus oncoprotein 

10 ElA in virally transfonned cells (Giordona et al. Cell 58:981 (1989); and Pmes et al. Nature 
346:760 (1990)). In an early hepatocellular carcinoma, the human cyclin A gene was found 
to be the mtegration site of a fragment of the hepatitis B virus, which leads to activation of 
cyclin A transcription and a chimeric viral cyclin A protem that is not degradable in vitro 
(Wang et al. Nature 343:555 (1990)). The cell-cycle gene implicated most strongly in 

15 oncogenesis thus far is the human cyclin Dl. It was originally isolated through genetic 
complementation of yeast Gi cyclin deficient strains (Xiong et al. Cell 65:691(1991); and 
Lew et al. Cell 66:1 197 (1991)), as cellular genes whose transcription is stunulated by CSF-1 
in murine macrophages (Matsushine et al. Cell 65:701 (1991)) and in the putative oncogene 
PRADl rearranged in parathyroid tumors (Montokura et al. Nature 350:512 (1991). Two 

20 additional human D-type cyclins, cyclins D2 and D3, were subsequently identified usmg 
PGR and low-stringency hybridiazation techniques (Inaba et al. Genomics 13:565 (1992); and 
Xiong et al. Genomics 13:575 (1992)). Cyclin Dl is genetically Imked to the bcUl oncogene, 
a locus activated by translocation to an immunoglobulin gene enhancer in some B-cell 
lymphomas and leukemias, and located at a site of gene amplification in 15-20% of human 

25 breast cancers and 25-48% of squamous cell cancers of head and neck origin. 

However, the creation of a mutant onocogene is only one of the requirements needed 
for tumor formation; tumorigenesis appears to also require the additional inactivation of a 
second class of critical genes: the "anti-oncogenes" or "tumor-suppressing genes." In their 
natural state these genes act to suppress cell proliferation. Damage to such genes leads to a 

30 loss of this suppression, and thereby results in tumorigenesis. Thus, the deregulation of cell 
growth may be mediated by either the activation of oncogenes or the inactivation of tumor- 
suppressing genes (Weinberg, ILA., (Sept 1988) Scientific Amer.^^ 44-51). 

Oncogenes and tumor-suppressing genes have a basic distinguished feature. The 
oncogenes identified thus fer have arisen only in somatic cells, and thus have been incapable 

35 of transmitting their effects to the germ line of the host animal. In contrast, mutations in 
tumor-suppressmg genes can be identified in germ line cells, and are thus transmissible to an 
animal's progeny. 
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The classic example of a hereditary cancer is retinoblastomas in children. The 
incidence of the retinoblastomas is determined by a tumor suppressor gene, the 
retinoblastoma (RB) gene (Weinberg, R.A., (Sept 1988) Scientific Amer.pp 44-51; Hansen et 
al. (1988) Trends Genet 4:125-128). Individuals bom with a lesion in one of the RB alleles 
5 are predisposed to early childhood development of retinoblastomas. Inactivation or mutation 
of the second RB allele in one of the somatic cells of these susceptible individuals appears to 
be the molecular event that leads to tumor formation (Caveneee et al. (1983) Nature 305:799- 
784; Friend et al. (1987) PNAS 84:9059-9063). 

The RB tumor-suppressing gene has been localized onto human chromosome 13. The 

10 mutation may be readily transmitted through the germ line of afflicted individuals (Cavenee, 
et al. (1986) New Engl J. iWferf 314:1201-1207). Individuals who have mutations m only one 
of the two naturally present alleles of this tumor-suppressing gene are predisposed to 
retinoblastoma. Inactivation of the second of the two alleles is, however, required for 
tumorigenesis (Knudson (1971) PNAS 68:820-823). 

15 A second tumor-suppressing gene is the p53 gene (Green (1989) Cell 56:1-3; Mowat 

et al (1985 Nature 314:633-636). The protein encoded by the p53 gene is a nuclear protein 
that forms a stable complex with both the SV40 large T antigen and the adenovmis ElB 55 
kd protein. The p53 gene product may be inactivated by binding to these proteins. 

Based on cause and effect analysis of p53 mutants, the functional role of p53 as a 

20 "cell-cycle checkpoint", particularly with respect to controlling progression of a cell from G 1 
phase into S phase, has implicated pS3 as able to directly or indirectly afifect cycle cycle 
machinery. The first firm evidence for a specific biochemical link between p53 and the cell- 
cycle comes a finding that p53 apparently regulates expression of a second protem, p21, 
which inhibits cyclin-dependent kinases (CDKs) needed to drive cells through the cell-cycle, 

25 e.g. from Gl into S phase. For example, it has been demonstrated that non-viral 
transformation, such as resuhing at least in part from a mutation of deletion of of the p53 
tumor suppressor, can result in loss of p21 horn cyclin/CDK complexes. As described Xiong 
et al. (1993) Nature 366:701-704, induction of p21 in response to p53 represents a plausible 
mechanism for effecting cell-cycle arrest in response to DNA damage, and loss of p53 may 

30 deregulate growth by loss of the p2 1 cell-cycle inhibitor. 

The role of RB as a tumor-suppressor protein in cell-<^cle control is believed to be 
similiar to that of p53. However, Viliereas p53 is generally believed to be responsive to such 
indigenous environmental cues as DNA damage, the RB protein is apparently involved in 
coordinating cell growth with exogenous stimulus that normally persuade a cell to cease 

35 proliferatmg, such as diffusible growth mhibitors. In normal cells, RB is expressed 
throughout the cell cycle but exists in multiple phosphorylated forms that are specfic for 
certain phases of the cycle. The more highly phosphorylated forms are found during S and 
G2/M, whereas the underphosphorylated forms are the primary species seen in Gi and in the 
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growth arrested state. Base on these observations, it has been argued that if RB is to have a 
regulatory (suppressive) activity in the cell-cycle, this activity must be regulated at the post- 
translational level. Accordingly, underphosphorylated RB would be the form with growtii- 
suppressive acitivty, since this form is prevalent in Gl and growth arrested cells. 

To this end, it is noted that various paracrine growth inhibitors, such as members of 
the TGF-p family, prevent phosphorylation of RB and arrest cells in late Gi. Current models 
suggest that during Gl, cyclin dependent kinases and particularly cyclin D-associated 
kinases, CDK4 and CDK6, phosphotylate the product of the retinoblastoma susceptibility 
gene, RB, and thus release cells from its growth inhibitory eflects. TGF-p treatment causes 
accumulation of RB in the under-phosphorylated state and expression of RB-inactivating 
viral oncoprotems prevent TGF-p induced cell cycle arrest. In sunilar fashion, other rehited 
differentiation factors, such as activin, induce accumulation of unphosphorylated RB that is 
correlated with arrest in G] phase. 

Recentiy, it has been demonstrated that the RB protein is a phosphorylation substrate 
for both CDK4 and CDK6 (Serano et al. (1993) Nature 366:704-707; Kato et al. (1993) 
Genes Dev 7:331-342; and Meyerson et al. (1994) Mol Cell Biol 14:2077-2086). However, 
prior to the present discovery, there was little information concemmg tiie maimer by which 
CDK phosphorylation of RB was negatively regulated. 



Summary of the Invention 



The present invention relsrtes to tiie discovery in eukaryotic cells, particularly 
mammalian cells, of a novel femily of cell-cycle regulatory proteins ("CCR-protems"). As 
described herein, tiiis famUy of proteins includes a polypeptide having an apparent molecular 
weight of 16 kDa, and a polypeptide havmg an apparent molecular weight of approximately 
15 kDa, each of which can function as an inhibitor of cell-cycle progression, and tiierefore 
ultimately of ceU growtii. Thus, similar to tiie role of p21 to tiie p53 checkpoint, tiie subject 
CCR-proteins may function coordinate^ Avitii flie cell-cycle regulatory protein, 
retinoblastoma (RB). Furtiiermore, tiie CCR-protein family includes a protein having an 
apparent molecular weight of 13.5 kDa (hereinafter "pl3.5"). The presumptive role of pl3.5, 
like pi 6 and pl5, is in the regulation of tiie cell-cycle. 

One aspect of tiie invoition features a substantially pure preparation of a cell cycle 
regulatory (CCR) protein, or a fiagment tiiereof, tiie full-lengtii form of tiie CCR-protein 
having an approxunate molecular weight m tiie range of 12 to 20kD, preferably 13kD to 
18kD. In preferred embodiments, tiie full lengtii form of tiie CCR-protein has an apparent 
molecular weight of approximately 13.5kD, 15kD, IS.SkD or 16kD. In a preferred 
embodiment: tiie polypeptide has an amino acid sequence at least 60% homologous to tiie 
ammo acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8; tiie polypeptide has an 
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amino acid sequence at least 80% homologous to the amino acid sequence represented in one 
of SEQ ID No. 2» 4, 6 or 8; the polypeptide has an amino acid sequence at least 90% 
homologous to the amino acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8; the 
polypeptide has an amino acid sequence identical to the amino acid sequence represented in 
5 one of SEQ ID No, 2, 4, 6 or 8. In a preferred embodiment: the fragment comprises at least 5 
contiguous amino acid residues of SEQ ID No. 2, 4, 6 or 8; the fragment comprises at least 
20 contiguous amino acid residues of SEQ ID No. 2, 4, 6 or 8; the fragment comprises at 
least 50 contiguous amino acid residues of SEQ ID No. 2, 4, 6 or 8. For instance, the CCR- 
protein can comprise an amino acid sequence represented by the general formula: 

10 

Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa-Xaa-Gly- 
Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Piro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val- 
His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu-Val-Val-Leu-His- 
Xaa-Xaa~Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly-Arg-Leu-Pro- 
15 Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp-Xaa-Xaa-Xaa-Tyr-Leu- 
Arg-Xaa-Ala-Xaa-Gly 

For example, the CCR-protein can be represented by the sequence: 

Met-Asp-Pro-Ala-Ala-Gly-Ser-Ser-Met-Glu-Pro-Ser-Ala-Asp-Trp-Leu- 
20 Ala-Thr-Ala-Ala-Ala-Arg-Gly-Arg-Val-Glu-Glu-Val-Arg-Ala-Leu-Leu- 
Glu-Ala-Val -Ala-Leu- Pro~Asn-Ala~Pro-Asn-Ser-Tyr-Gly-Arg-Arg- Pro- 
Ile-Gln-Val-Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu- 
Xaa-Xaa-Gly-Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa- 
Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu-Val- 
25 Val-Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly- 
Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp-Xaa-Xaa- 
Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly-Gly-Thr-Arg-Gly-Ser-Asn-His-Ala- 
Arg-Ile-Asp-Ala-Ala-Glu-Gly-Pro-Ser-Asp-Ile-Pro-Asp; 

30 or altematively, by the sequence: 

Met-Arg-Glu-Glu-Asn-Lys-Gly-Met-Pro-Ser-Gly-Gly-Gly-Ser-Asp-Glu- 
Gly-Leu-Ala-Thr-Pro-Ala-Arg-Gly-Leu-Val-Glu-Lys-Val-Arg-His-Ser- 
Trp-Glu-Ala-Gly-Ala-Asp-Pro-Asn-Gly-Val-Asn-Arg-Phe-Gly-Arg-Arg- 

35 Ala-Ile-Gln-Val-Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu- 
Leu-Xaa-Xaa-Gly-Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa- 
Xaa-Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu- 
Val-Val-Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp- 
Gly-Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp-Xaa- 

40 Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly- Asp , 

or yet in another embodiment, by the sequence: 

Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa-Xaa-Gly- 
45 Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val- 
His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu-Val-Val-Leu-His- 



10 



20 
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Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly-Arg-Leu-Pro- 
Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp-Xaa-Xaa-Xaa-Tyr-l,eu- 
Arg-Xaa-Ala-Xaa-Gly-Cys-Ser-Leu-Cys-Ser-Ala-Gly-Trp-Ser-Leu-Cys- 
Thr-Ala-Gly-Asn-Val-Ala-Gln-Thr-Asp-Gly-His-Ser-Phe-Ser-Ser-Ser- 
Thr-Pro-Arg-Ala-Leu-Glu-Leu-Arg-Gly-Gln-Ser-Gln-Glu-Gln-Ser. 

In preferred embodiments, the CCR-protein specifically binds a CDK, e.g. a Gj phase CDK, 
e.g. CDK4 and/or CDK6. The CCR-protein can be cloned fiom a mammalian cell, e.g. a 
hxmian cell, e.g. a mouse cell. 

Another aspect of the present invention features a polypeptide, of the CCR-protein 
family, which functions m one of either role of an agonist of cell-cycle regulation or an 
antagonist of cell-cycle regulation. In a preferred embodiment: the subject CCR-protein 
specifically binds a cyclin dependent kinase (CDK), e.g. specifically binds CDK4; e.g. 
specifically binds CDK6; e.g. inhibits a kinase activity of CDK4; inhibits a kinase activity of 
15 CDK6; e.g. inhibits phosphorylation of an FJB protein by CDK4. In a more preferred 
embodiment: the CCR-protein regulates a eukaryotic cell-cycle, e.g. a mammalian cell-cycle, 
e.g., a human cell-cycle; the CCR-protein inhibits proliferation/cell growth of a eukaryotic 
cell, e.g., a human cell; the CCR-protein inhibits progression of a eukaryotic cell fi-om Gl 
phase into S phase, e.g., inhibits progression of a mammaUan cell from Gl phase into S 
phase, e.g., inhibits progression of a human cell fi«m Gl phase into S phase; the CCR- 
protein inhibits the kinase activity of a cyclin dependent kinase (CDK), e.g. a CDK active in 
Gl phase, e.g. CDK 4; the CCR-protein suppresses tumor growth, e.g. in a tumor cell, e.g. in 
a tumor ceU having an unimpaired RB or RB-Iike protein checkpoint Moreover, CCR- 
proteins of the present invention may also have biological activities which include: an abiUty 
25 to regulate ceU-cycle progression in response to extracellular factors and cytokines, e.g. 
fimctional in paracrine or autocrine regulatic of cell growth and/or differentiation, e.g. 
inhibit CDK activation in response to Uransfi-rming growth &ctor-P (TGF-P) or related 
growth, differentiation or morphogenesis factoi. 

Still anotiier aspect of the invention concerns a polypeptide having an ammo acid 
30 sequence homologous to an amino acid sequence rq>resented by one of SEQ ID Nos. 

Yet another aspect of the present invention concerns an immunogen comprising a 
CCR-protein of the present invention, or a fragment thereof; in an immunogenic preparation, 
the immunogen being capable of eKciting an immune response specific for the CCR-protein; 
e.g. a humoral refuse, e.g. an antibody reqionse; e.g. a cellular response. 
35 Another aspect of the presort mvention features recombinant CCR-protein, or a 

fragment thereof, having an amino acid sequence preferably: at least 60% homologous to the 
amino acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8; at least 80% homologous 
to the amino acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8; at least 90% 
homologous to Hbe amino acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8; 
40 identical to Hoc amino acid sequence represented in one of SEQ ID No. 2, 4, 6 or 8. In a 
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preferred embodiment: the fragment comprises at least S contiguous amino acid residues of 
SEQ ID No. 2, 4, 6 or 8; the fragment comprises at least 20 contiguous amino acid residues 
of SEQ ID No. 2, 4, 6 or 8; the fragment comprises at least 50 contiguous amino acid 
residues of SEQ ID No. 2, 4, 6 or 8. In a preferred embodiment, the recombinant CCR- 
5 protein functions in one of either role of an agonist of cell-cycle regulation or an antagonist 
of cell-cycle regulation. In a more preferred embodiment: the CCR-protein specifically binds 
a cyclin dependent kmase (CDK), e.g. specifically binds CDK4; e.g. specifically bmds 
CDK6; e.g. inhibits a kinase activity of CDK4; inhibits a kinase activity of CDK6; e.g. 
inhibits phosphorylation of an RB protein by CDK4. In a more preferred embodiment: the 

10 CCR-protein regulates a eukaryotic cell-cycle, e.g. a mammalian cell-cycle, e.g., a human 
cell-cycle; the CCR-protein inhibits prolifemtion/cell growth of a eukaryotic cell, e.g., a 
human cell; the CCR-protein mhibits progression of a eukaryotic cell from Gl phase into S 
phase, e.g., mhibits progression of a mammalian cell from Gl phase into S phase, e.g., 
inhibits progression of a human cell from Gl phase into S phase; the CCR-protein inhibits 

15 the kinase activity of a cyclin dependent kinase (CDK), e.g. a CDK active in Gl phase, e.g. 
CDK 4; the CCR-protein suppresses tumor growth, e.g. in a tumor cell, e.g. in a tumor cell 
having an unimpaired RB or RB-like protem checkpoint. 

In yet other preferred embodiments, the recombinant CCR-protein is a fusion protein 
further comprising a second polypeptide portion having an amino acid sequence from a 

20 protein unrelated the protein of SEQ ID No. 2, 4, 6 or 8. Such fusion proteins can be 
functional in a two-hybrid assay. 

Another aspect of the present invention provides a substantially pure nucleic acid 
having a nucleotide sequence which encodes a CCR-protein, or a fragment thereof, having an 
amino acid sequence at least 60% homologous to one of SEQ ID Nos. 2, 4, 6 or 8. In a more 

25 preferred embodiment: the nucleic acid encodes a protein having an amino acid sequence at 
least 80% homologous to SEQ ID No. 2, more preferably at least 90% homologous to SEQ 
ID No. 2, and most preferably at least 95% homologous to SEQ ID No. 2; the nucleic acid 
encodes a protein having an amino acid sequence at least 80% homologous to SEQ ID No. 6, 
more preferably at least 90% homologous to SEQ ID No. 6, and most preferably at least 95% 

30 homologous to SEQ ID No. 6. The nucleic preferably encodes a CCR-protein which 
specifically binds a cyclin dependent kinase (CDK); e.g. specifically binds CDK4; e.g. 
specifically binds CDK6; e.g. which inhibits a kinase activity of CDK4; e.g. which inhibits 
phosphoiylation of an RB protein by CDK4. 

In another embodiment, the nucleic acid hybridizes under stringent conditions to a 

35 nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No. 1; 
more preferably to at least 20 consecutive nucleotides of SEQ ID No. 1 ; more preferably to at 
least 40 consecutive nucleotides of SEQ ID No. 1. 
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In a further embodiment, the nucleic acid hybridizes under stringent conditions to a 
nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No. 3; 
more preferably to at least 20 consecutive nucleotides of SEQ ID No. 3; more preferably to at 
least 40 consecutive nucleotides of SEQ ID No. 3. 

In yet a fiirther embodiment, the nucleic acid hybridizes under stringent conditions to 
a nucleic acid probe corresponding to at least 12 consecutive nucleotides of SEQ ID No. 5; 
more preferably to at least 20 consecutive nucleotides of SEQ ID No. 5; more preferably to at 
least 40 consecutive nucleotides of SEQ ID No. 5. 

In yet a furthw embodiment, the nucleic acid hybridizes under stringent conditions to 
a nucleic acid probe conespondmg to at least 12 consecutive nucleotides of SEQ ID No. 7; 
more preferably to at least 20 consecutive nucleotides of SEQ ID No. 7; more preferably to at 
least 40 consecutive nucleotides of SEQ ID No. 7. 

Furthermore, in certain embodiments, the CCR nucleic acid will comprise a 
ti-anscriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
15 ti-anscriptional enhancer sequence, operably linked to the CCR-gene sequence so as to render 
tiie recombinant CCR-gene sequence suitable for use as an expression vector. 

The present invention also featiores ttansgenic non-human animals, e.g. mice, which 
either express a heterologous CCR-gene, e.g. derived from humans, or which mis-express 
their own CCR-gene, e.g. pl6, pl5 or pl3.5 expression is disrupted. Such a transgenic 
20 animal can serve as an animal model for stiidying cellular disorders comprising mutated or 
mis-expressed CCR alleles. 

The present invention also provides a probe/primer comprising a substantially 
purified oligonucleotide, wherein the oUgonucleotide comprises a region of nucleotide 
sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides 
25 of sense or antisense sequence of one of SEQ ID No. 1, 3, 5 or 7, or naturaUy occurring 
mutants thereof In preferred embodiments, tiie probe/primer further comprises a label gfoap 
attached tiiereto and able to be detected, e.g. tiie label group is selected from a group 
consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. Such 
probes can be used as a part of a diagnostic test kit for identifying transformed cells, such as 
for measuring a level of a pl6, pl5 or pl3.5 encoding nucleic acid in a sample of cells 
isolated from a patient; e.g. for measuring the mRNA level in a cell or determining whetiier 
the genomic CCR gene has been mutated or deleted. 

The present invention also provide a metlK>d for treating an animal having unwanted 
cell growtii characterized by a loss of wild-type CCR-protein function, comprising 
35 administering a tiierapeutically effective amount of an agent able to inhibit a kinase activity 
of a CDK, e.g. CDK4. In one embodunent, tiie metfiod comprises administering a nucleic 
acid construct encoding a CCR protein, e.g. pl6, pl5 or pl3.5, e.g. a polypeptide represented 
in one of SEQ ID Nos. 2, 4, 6 or 8, under conditions wherein tiie construct is incorporated by 
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CCR-deficient cells and the polypeptide is expressed, e.g. by gene therapy techniques. In 
another embodiment, the method comprises administering a CCR mimetic, e.g. a 
peptidomimetic, which binds to and inhibits the CDK. 

Another aspect of the present invention provides a method of determining if a subject, 
e.g. a human patient, is at risk for a disorder characterized by unwanted cell proliferation, 
comprising detecting, in a tissue of the subject, the presence or absence of a genetic lesion 
characterized by at least one of (i) a mutation of a gene encoding a protein represented by 
one of SEQ ID Nos. 2, 4, 6 or 8, or a homolog thereof; or (ii) the mis-expression of the 
CCR-gene, e.g. the pl6, pl5 or pl3.5 gene. In preferred embodiments: detecting the genetic 
lesion comprises ascertaining the existence of at least one of a deletion of one or more 
nucleotides firom said gene, an addition of one or more nucleotides to said gene, an 
substitution of one or more nucleotides of said gene, a gross chromosomal rearrangement of 
said gene, a gross alteration in the level of a messenger RNA transcript of said gene, the 
presence of a non-wild type splicing pattem of a messenger RNA transcript of said gene, or a 
non-wild type level of said protein. For example, detecting the genetic lesion can comprise 
(i) providmg a probe/primer comprising an oligonucleotide containing a region of nucleotide 
sequence which hybridizes to a sense or antisense sequence of one of SEQ ID Nos. 1, 3, 5 or 
7, or naturally occurring mutants thereof, or 5' or 3^ flankmg sequences naturally associated 
with the CCR-gene; (ii) exposing the probe/primer to nucleic acid of the tissue; and (iii) 
detecting, by hybridization of the probe/primer to the nucleic acid, the presence or absence of 
the genetic lesion; e.g. wherein detecting the lesion comprises utilizing the probe/primer to 
determine the nucleotide sequence of the CCR-gene and, optionally, of the flanking nucleic 
acid sequences; e.g. wherein detecting the lesion comprises utilizing the probeyi>rimer in a 
polymerase chain reaction (PCR); e.g. wherein detecting the lesion comprises utilizing the 
probe/primer in a ligation cham reaction (LCR). In alternate embodhnents, the level of said 
protein is detected in an immunoassay. 

Yet another aspect of the invention pertains to a peptidomimetic which binds to a 
CCR-protem, e.g. pl5 or pi 6, and inhibits its bmding to a CDK, e.g. CDK4 or CDK6. For 
example, a preferred peptidomimetic is an analog of a peptide having the sequence 
VAEIG(V/E)GAYG(T/K).V(F/Y)KARD, though more preferably the peptidomimetic is an 
analog of the hexa-peptide V(F/Y)KARD, and even more preferably of the tetrapeptide 
KARD. Likewise, another preferred peptide analog of CDK4/CDK6 comprises a peptide or 
peptidomimetic corresponding to the sequence SRTDRE(l'l)K(V/L)TLWEHVIX3DL(Ra^ 
TYLDK(AAOPPPG(LAO, though more preferably the peptidomimetic is an analog of the 
hexa-peptides FEHVDQ or EHVDQD, and even more preferably of the tetrapeptides HVDQ 
or EHVD. Non-hydrolyzable peptide analogs of such residues can be generated using, for 
example, benzodiazepine, azepine, substituted gama lactam rings, keto-methylene 
pseudopeptides, p-tum dipeptide cores, or P-aminoalcohols. 
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Other features and advantages of the invention will be apparent from the following 
detailed description, and from the claims. The practice of the present invention will 
employ, unless otherwise indicated, conventional techniques of cell biology, cell culture, 
molecular biology, transgenic biology, microbiology, recombinant DNA, and immunology, 
5 which are within the skill of the art. Such techniques are explained fully in the literature. 
See, for example, Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, 
Fritsch and Maniatis (Cold Spring Harbor Laboratory Press:1989); DNA Cloning, Volumes I 
and II (D. N. Glover ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed., 1984); MuUis et al. 
U.S. Patent No. 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 

10 1984); Transcription And Translation (B. D. Hames & S. J. Higgins eds. 1984); Culture Of 
Animal Cells (R. 1. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984); the treatise. 
Methods In Enzymology (Academic Press, Inc., N.Y.); Gene Transfer Vectors For 
Mammalian Cells (J. H. Miller and M. P. C:alos eds., 1987, Cold Spring Harbor Laboratory); 

15 Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.). Immunochemical Methods In 
Cell And Molecular Biology (Mayer and Walker, eds.. Academic Press, London, 1987); 
Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weu- and C. C. Blackwell, 
eds., 1986); Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratoiy Press, Cold 
Spring Harbor, N.Y., 1986). 

20 

DescripUon of the Drawing 

Figures lA-lB are a schematic representation of the pi 6 cDNA, indicating the 
location of exon boundaries and PCR primers used in the present invention. 
25 Figures 2A and 2B are the genomic nucleic acid sequence for exon 1 and the non- 

coding sequences directly flankmg exon 1. The sequence is a composite sequence from 
several primers. Figure 2C is the genomic sequence about exon 2. 

Figures 3 A-3D are the genomic nucleic acid sequence for exon 3 and the non-coding 
sequences dirwtly flanking exon 3. The sequence is a composite sequence from several 
30 primers. 

Figure 4 illustrates the loss of pl6 sequences from the genomes of several human 
tun^or cells, as compared to normal human controls and other human tumors. 
Figure 5 illustrates the restriction map for the mouse pi 6 gene. 
Figure 6 is a sequence alignment of a highly conserved portion of the CCR-proteins 
35 pl6,pl5andpl3.5. 

Figure 7 is a sequence alignment of portions of cyclin-dependent kinases. 
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Detailed Description of the Invention 



Progression through the cell-cycle is marked by a series of irreversible transitions that 
separate discrete tasks necessary for faithful cell duplication. These transitions are negatively 
5 regulated by signals that constrain the cell-cycle until specific conditions are fulfilled. Entry 
in to mitosis, for example, is inhibited by incompletely replicated DNA or DNA damage. 
These restriction on cell-cycle progression are essential for preserving the fidelity of the 
genetic information during cell division. The transition from Gj to S phase, on the other 
hand, coordinates cell proliferation with environmental cues, after which the checks on the 
10 cell-cycle progression tend to be cell autonomous. Among the signals that restrict cell-cycle 
progression during G] are extracellular proteins which inhibit cell proliferation, growth factor 
or amino acid depletion, and cell-cell contact. Disruption of these signaling pathways 
uncouples cellular responses from environmental controls and may lead to unrestrained cell 
proliferation. 

15 Eukaryotic cells, in general, require cyclin-dependent kinases (CDKs) for progression 

through G] and entry into S phase. For instance, in mammalian cells, both D- and E-type 
cyclins are rate limiting for the Gj to S transition, and both reduce, but do not eliminate, the 
cell's requirement for mitogenic growth factors. However, prior to the present discovery, 
there was little information concerning the manner by which these cyclins and CDKs are 

20 negatively regulated by either intracellular or extracellular signals that inhibit cell 
proliferation* 

The present invention is du:ected to the discovery of a family of related cell-cycle 
regulatory proteins (herein refered to as "CCR-proteins") which function typically to restrict 
progression of a cell through mitosis, and are likely to be involved in controlling progression 

25 through meiosis. Members of this family are evolutionarily related to the approximately 
16kd protein represented in SEQ ID No. 2, designated "pl6" (also "pl6lNK4a»») -jhis family 
includes, for example, a polypeptide (termed "pl5") having an approximate molecular weight 
of 15Kd, and a 13.5 Kd polypeptide (termed "pi 3.5"). The nucleotide sequences for the 
human pl6, the human pl5, the mouse pl3.5, and the mouse pl5 coding sequences are 

30 provided in SEQ ID Nos. 1, 3, 5 and 7, respectively. The corresponding amino acid 
sequences are represented in SEQ ID Nos. 2, 4, 6 and 8. Moreover, data from hybridization 
and immunoprecipitation experiments indicates still other members of the CCR-protein 
&nuly exist, comprising proteins representing both evolutionarily divergent sequences as 
well as differentially spliced variants. 

35 One function of members of this family of proteins in cell-cycle regulation is in 

modulating the activity of cyclin/CDK complexes during various stages of the cell-cycle, 
particularly those which include CDKs active in Gj phase, such as CDK4 or CDK6. To 
illustrate, both pl6 and pi 5 are demonstrated below to exert an inhibitory effect on the 
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activity of cyclin/CDK complexes, particularly those which include CDK4 or CDK6. For 
instance, each protein is able to inhibit the activity of cydin DUCDK complexes in wVo. As 
is generally known, cyclin Dl has been associated with a wide variety of proliferative 
diseases. Consequently, the present invention identifies a potential inhibitor of cell 
proliferation resulting from oncogenic expression of cyclin Dl. Moreover, the diversity of 
members of the CCR-protein family, lilce the diversity of CDKs, is suggestive of 
individualistic roles of each member of this family, which may be tissue-type of cell-type 
specific, occur at different points in the cell-cycle, occur as part of different extracellular or 
intracellular signaling pathways, or a combination thereof. 

As described in the examples below, certain of the CCR-proteins have been shown to 
be deleted or mutated at high fi«quency in tumors, such as derived from lung, breast, brain, 
bone, skin, bladder, kidney, ovaiy, or lymphocytes. Consequently, as set forth in the present 
plication, replacement of CCR protein function by gene therapy or by CCR mimetics, or by 
direct inhibition of CDK4 or CDK6 activity, is therefore a potential therapy for treating such 
15 proliferative disorders. Moreover, the present data demonstrates fliat pl5 expression is 
regulated by treatment with transforming growth factor-beta (TGF-p), suggesting that pl5 
may function as an effector of TGF-P mediated cell cycle arrest via inhibition of CDK4 or 
CDK6 kinases. Considered in light of recent findings demonstrating tiiat reduced 
responsiveness to TGF-p may be an important event in tiie loss of growth conti-ol in many 
20 proliferative disorders, an approach to modulate CDK4/6 activity by CCR mimetics or CCR- 
gene therapy, or by medianism based inhibitors of the kinases tiiemselves, is even more 
attractive for treating such proliferative disordeis. 

Accordingly, the present invention makes available diagnostic and theisqjeutic assays 
and reagents for detecting and ti«ating proliferative disorders arising from, for example. 
25 tiunorigenic transformation of ceUs, or otiier hyperplastic or neoplastic transformation 
processes, as well as dififerentiative disorders, such as degeneration of tissue, e.g. 
neurodegeneration. For example, tiie present invention makes available reagents, such as 
antibodies and nucleic acid probes, for detecting altered complex formation, and/or altered 
levels of CCR-protem expression, and/or CCR-gene deletion or mutation, in order to identify 
30 transformed cells. Moreover, tiie present invention provides a metiiod of treating a wide 
variety of pathological ceU proliferative conditions, such as by gene tiierapy utilizmg 
recombinant gene constructs encoding tiie subject CCR-protems, or by providing CCR- 
mimetics, witii the general strategy being tiie inhibition of aberranfly proliferatiiig ceUs. 

The subject proteins can also be used in assay systems to identify agents which eitiier 
35 decrease tiie abiUty of tiie CCR-protein to bind a CDK (e.g. CDK4 or CDK6) and tiiereby 
relieve mhibition of cyclin/CDK complexes, or alternatively, which agonize or mimic flie 
CCR-mediated inhibition of CDK activation. In tiie latter, e.g. CCR mimetics. tiie 
consequence of inhibiting activation of a cyclin/CDK complex, e.g. cyclin D/CDK4, is tiie 
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failure of the cell to advance through the cell-cycle, which inhibition can lead ultimately to 
cell death. Reactivation of the CDK/cyclin complexes, on the other hand, can disrupt or 
otherwise imbalance the cellular events occurring in a transformed cell. Such agents can be 
of use therapeutically to activate CDK complexes in cells transformed, for example, by tumor 
5 viruses. Treatment of such cells can cause premature progression through a checkpoint, e.g. 
the retinoblastoma (RB) checkpoint, and result in mitotic catastrophe (cell death) or induction 
of apoptosis. 

As used herein, the term "nucleic acid" refers to polynucleotides such as 

deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The terra 
10 should also be understood to include, as equivalents, analogs of either RNA or DNA made 

firom nucleotide analogs, and, as applicable to the embodiment being described, single (sense 

or antisense) and double-stranded polynucleotides. 

As used herein, the terms "gene", "recombinant gene" and "gene construct" refer to a 

nucleic acid comprising an open reading frame encoding a cell-cycle regulatoiy of the present 
15 invention, including both exon and (optionally) intron sequences. In preferred embodiments, 

the nucleic acid is DNA or RNA. Exemplary recombinant genes include nucleic acids which 

encode all or a CDK-binding portion of the pl6 protein represented in SEQ ID No. 2, the pi 5 

proteins represented in SEQ ID Nos. 4 and 8, or the pl3.5 protein represented in SEQ ID No. 

6. The term "intron" refers to a DNA sequence present in a given CCR-gene which is not 
20 translated into protein and is generally fotmd between exons. 

"Homology" refers to sequence similarity between two peptides or between two 

nucleic acid molecules. Homology can be determined by comparing a position in each 

sequence vAdch may be aligned for purposes of comparison. When a position in the 

compared sequence is occupied by the same base or amino acid, then the molecules are 
25 homologous at that position. A degree of homology between sequences is a function of the 

number of matching or homologous positions shared by the sequences. 

The term "transfection" refers to the introduction of a nucleic acid, e.g., an 

expression vector, into a recipient cell by nucleic acid-mediated gene transfer. 

"Transformation", as used herein, refers to a process in which a cell's genotype is changed as 
30 a result of the cellular ixptske of exogenous DNA or RNA, and, for example, the transformed 

cell expresses a recombinant form of one of the subject cell-cycle regulatory proteins, e.g. 

'pld.plSorplS.S. 

"Cells" or "cell cultures" or "lecombmant host cells" or "host cells" are often used 
interchangeably as will be clear firom the context. These terms include the immediate subject 
35 cell which expresses the cell-cycle regulatory protein of tiie present invention, and, of course, 
the progeny thereof It is understood that not all progeny are exactly identical to the parental 
cell, due to chance mutations or difference in environment. However, such altered progeny 
are included in these terms, so long as the progeny retain the characteristics relevant to those 
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conferred on the originally transfonned cell. In the present case, such a characteristic might 
be the ability to produce a recombinant CCR-protein, 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to vMch it has been linked. The term "expression vector- 
includes plasmids, cosmids or phages capable of synthesizing the subject CCR-protein 
encoded by the respective recombinant gene carried by the vector. Preferred vectors are 
those cquble of autonomous replication and/expression of nucleic acids to which they are 
linked. In the present specification, "plasmid" and "vector" are used interchangeably as the 
plasmid is the most commonly used form of vector. Moreover, the invention is intended to 
include such other forms of expression vectors which serve equivalent functions and which 
become known in the art subsequently hereto. 

"Transcriptional regulatory sequence" is a generic term used throughout the 
specification to refer to DNA sequences, such as initiation signals, enhancers, and promoters, 
as well as polyadenylation sites, which induce or control transcription of protein coding 
1 5 sequences with which they are operably linked. In preferred embodiments, transcription of a 
recombinant CCR-gene is under the control of a promoter sequence (or other transcriptional 
regulatory sequence) which controls the expression of the recombinant gene in a cell-type in 
which expression is intended. It will also be understood that the recombinant gene can be 
under the control of transcriptional regulatory sequences which are the same or which are 
20 different from those sequences which control transcription of the naturally-occurring form of 
flie regulatory protein. 

The term "tissue-specific promoter" means a DNA sequence that serves as a 
promoter, i.e., regulates expression of a selected DNA sequence opeiably linked to the 
promoter, and which effects expression of the selected DNA sequence in specific cells of a 
25 tissue, such as cells of a neuronal lineage, e.g. glial cells, or alternatively, in epithelial cells, 
e.g. melanocytes. In an illustrative embodiment, gene constructs utilizing glial-specific 
promoters can be used as a part of gene therapy to cause expression of recombinant forms 
of one of the subject CCR-proteins in glioma ceUs with a feature of the gene construct 
beiAg a tissue-specific promoter for directing expression of the subject protein in only glial 
30 tissue. The term also covers so-called "lealqr" promoters, which regulate expression of a 
selected DNA primarily in one tissue, but cause eiqnression in other tissues as well. 

As used herein, a "transgenic animal" is any animal, preferably a non-human mammal 
in which one or more of the cells of the animal contam heterologous nucleic acid introduced 
by way of human intervention, such as by trangenic techniques well known in the art. The 
35 nucleic acid is introduced into the ceU, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by 
infection with a recombinant virus. The term genetic manipulation does not include classical 
cross-breeding, or in vitro fertilization, but rather is directed to tiie introduction of a 
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recombinant DNA molecule. This molecule may be integrated within a chromosome, or it 
may be extrachromosomally replicating DNA. In the typical transgenic animals described 
herein, the transgene causes cells to express a recombinant form of the subject CCR 
proteins, e.g. either agonistic or antagonistic forms, or in which an endogenous CCR-gene 
5 has been disrupted. However, transgenic animals in which the recombinant CCR-gene is 
silent are also contemplated, as for example, the FLP or CRE recombinase dependent 
constructs described below. The "non-human animals" of the invention include vertebrates 
such as rodents, non-himian primates, sheep, dog, cow, amphibians, reptiles, etc. Preferred 
non-human animals are selected from the rodent family including rat and mouse, most 

10 preferably mouse. The temi "chimeric animal" is used herein to refer to animals in which 
the recombinant gene is found, or in which the recombinant is expressed in some but not all 
cells of the animal. The term "tissue-specific chimeric animal" indicates that the recombinant 
CCR-gene is present and/or expressed, or disrupted, in some tissues but not others. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., a 

15 CCR-polypeptide), which is partly or entirely heterologous, i.e., foreign, to the transgenic 
animal or cell into which it is introduced, or, is homologous to an endogenous gene of the 
transgenic animal or cell into which it is introduced, but which is designed to be inserted, or 
is inserted, into the animal's genome in such a way as to alter the genome of the cell into 
which it is inserted (e.g., it is inserted at a location which differs from that of the natural gene 

20 or its insertion results in a knockout). A transgene can include one or more transcriptional 
regulatory sequences and any other nucleic acid, such as mtrons, that may be necessary for 
optimal expression of a selected nucleic acid. 

As used herein, the terms "transforming growth factor-beta" and "TGF-P" denote a 
&mily of structurally related paracrine polypeptides found ubiquitously in vertebrates, and 

25 prototypic of a large family of metazoan grcwtl, differentiation, and morphogenesis factors 
(see, for review, Massaque et al. (1990) Ann Rev Cell Biol 6:597-641; and Spom et al. (1992) 
JCe//fi/a/ 119:1017-1021). 

The term "evolutionarily related to", with respect to nucleic acid sequences encoding 
CCR-proteins, refers to nucleic acid sequences which have arisen naturally in an organism^ 

30 including naturally occurring mutants. In general, genes which encode proteins that are 
evolutionarily related to the pi 6 protem can be recognized by the general features of anl^rin- 
Uke repeats and the ability to bind to a cyclin dependent kinase. Moreover, the term 
"evolutionarily related to" also refers to nucleic acid sequences which, while derived from a 
naturally occurring CCR-proteins, have been altered by mutagenesis, as for example, the 

35 combinatorial mutagenesis techniques described below, yet still encode polypeptides which 
have at least one activity of a CCR-protein. For instance, the sequence of pl6 can be altered 
by mutagenesis based on amino acid substitutions derived from alignment with the pi 5 
and/or pi 3.5 sequences. 
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One a^ect of the present invention pertains to an iisolated nucleic acid comprising the 
nucleotide sequence encoding a CCR-protein, fiagments thereof encoding polypeptides 
having at least one biological activity of a CCR-protein, and/or equivalents of such nucleic 
adds. The term nucleic acid as used herein is intended to include such firagments and 
5 equivalents. The term equivalent is understood to include nucleotide sequences encoding 
functionally equivalent CCR-proteins or functionally equivalent peptides having an activity 
of a CCR-protein such as described herein. Equivalent nucleotide sequences will include 
sequences that differ by one or more nucleotide substitutions, additions or deletions, such as 
allelic variants; and will also include sequences that differ from the nucleotide sequence 

10 encoding the subject pl6 protem represented by SEQ ID No. 2, or the pl5 proteins 
represented by SEQ ID Nos. 4 and 8, or the plT^.S protein represented by SEQ ID No. 6 due 
to the degeneracy of the genetic code. EquIvaJents will also include nucleotide sequences 
that hybridize under stringent conditions (i.e., equivalent to about 20-27**C below the melting 
temperature (T^) of the DNA duplex formed in atK)ut IM salt) to the nucleotide sequence of 

15 a CCR-gene shown in SEQ ID No. 1, 3, 5 or 7. In one embodiment, equivalents will further 
include nucleic acid sequences derived from and evolutionarily related to the nucleotide 
sequence shown in one of SEQ ID No. 1 , 3, 5 or 7, 

The term "isolated" as also used herein with respect to nucleic acids, such as DNA or 
RNA, refers to molecules separated from other DNAs, or RNAs, respectively, that are present 

20 in the natural source of the macromolecule. For example, an isolated nucleic acid encoding 
on of the subject CCR-proteins preferably includes no more than 10 kilobases (kb) of nucleic 
acid sequence which naturally immediately flanks the CCR-gene in genomic DNA, more 
preferably no more than Skb of such naturally occurring flanking sequences, and most 
preferably less than l.Skb of such naturally occurring flanking sequence. The term isolated 

25 as used herein also refers to a nucleic acid or pepdde that is substantially free of cellular 
material or culture medium when produced by recombinant DNA techniques, or chemical 
precursors or other chemicals when chemically synthesized. Moreover, an "isolated nucleic 
acid" is meant to include nucleic acid fragments which are not naturally occurring as 
fragments and would not be found in the natural state. 

30 Polypeptides referred to herein as having an activity of a cell-cycle regulatory protein 

preferably have an amino acid sequence homologous to and corresponding to all or a portion 
of the amino acid sequence of the pi 6 protein shown in SEQ ID No. 2. Exemplary 
polypeptides include the pl5 protein shown in either SEQ ID No. 4 or 8, or of the pl3.5 
protein shown in SEQ ID No. 6, or isofonns of one of these proteins (including differential 

35 splicing variants). In preferred embodiments, the biological activity of a CCR-protein 
includes one or more of the following: (i) an ability to regulate a eukaryotic cell-cycle, e.g. a 
manunalian cell-cycle, e.g., a human cell-cycle; (ii) an ability to inhibit proliferation/cell 
growth of a eukaryotic cell, e.g. a mammalian cell, e.g., a human cell; (iii) an ability to 
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inhibit progression of a eukaiyotic cell £rom Gl phase into S phase, e.g., inhibit progression 
of a mammalian cell from Gl phase into S phase, e.g., inhibit progression of a human cell 
fix)m Gl phase into S phase; and/or (iv) an ability to mhibit the kinase activity of a cyclin 
dependent kinase (CDK), e.g. a CDK active in Gl phase, e.g. CDK 4, e.g. CDK6; e.g. an 
5 ability to inhibit phosphorylation of a retinoblastoma (RB) or retinoblastoma-like protein by 
a cyclin dependent kinase. Moreover, CCR-proteins of the present invention may also have 
biological activities which include: an ability to suppress tumor growth, e.g. in a tumor 
having an unimpaired RB protein; an ability to regulate cell-cycle progression in response to 
extracellular factors and cytokines, e.g. functional in paracrine or autocrine regulation of cell 

10 growth and/or differentiation, e.g. inhibit CDK activation in response to transforming growth 
&ctor-P (TGF-p) or related growth, differentiation or morphogenesis factor. In this respect, 
the CCR-proteins of the present invention may also fimction to prevent de-differentiation of 
cells/tissue. Other biological activities of the subject CCR-proteins are described herein or 
will be reasonably apparent to those skilled in the art in light of the present disclosure. 

15 Moreover, it will be generally appreciated that, under certain circumstances, it will be 

advantageous to provide homologs of naturally-occurring forms of particular CCR-proteins 
which are either agonists or antagonists of only a subset of that protein's biological activities. 
Thus, specific biological effects can be elicited by treatment with a homolog of limited 
function, and with fewer side effects relative to treatment with agonists or antagonists which 

20 are directed to all of the biological activities of that protein. For example, pl6 homologs can 
be generated vMch bind to and inhibit activation of CDK4 without substantially interfering 
with the activation of CDK6. 

In one embodiment, the nucleic acid of the invention encodes a peptide which is an 
agonist or antagonist of the pl6 protein and comprises an amino acid sequence shown in SEQ 

25 ID No. 2. Preferred nucleic acids encode a peptide having a pi 6 protein activity and being at 
least 60% homologous, more preferably 70% homologous and most preferably 80% 
homologous with an amino acid sequence shown in SEQ ID No. 2. Nucleic acids which 
encode peptides having an activity of a pi 6 protein and having at least about 90%, more 
preferably at least about 95%, and most preferably at least about 98-99% homology with a 

30 sequence shown in SEQ ID No. 2 are also within the scope of the invention. 

In another embodiment, the nucleic acid of the invention encodes a peptide which is 
an agonist or antagonist of the pi 5 protein and comprises an amino acid sequence shown in 
SEQ ID No. 4. Preferred nucleic acids encode a peptide having a pl5 protein activity and 
being at least 60% homologous, more preferably 70% homologous and most preferably 80% 

35 homologous with an amino acid sequence shown in one or both of SEQ ID Nos. 4 and 8. 
Nucleic acids which encode peptides having an activity of a pi 5 protein and having at least 
about 90%, more preferably at least about 95%, and most preferably at least about 98-99% 
homology with a sequence shown in SEQ ID No. 4 or 8 are also within the scope of the 
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invention. In a representative embodiment, the nucleic acid is a cDNA molecule comprising 
at least a portion of the nucleotide sequence encoding a pi 5 protein shown in SEQ ID No. 3 
or 7. A preferred portion of the cDNA molecule shown in SEQ ID No. 3 or 7 includes the 
coding region of the molecule. 
5 In yet another embodiment, the nucleic acid of the invention encodes a peptide havmg 

an activity of a pl3.5 protein and comprising an amino acid sequence shown in SEQ ID No. 
6. Preferred nucleic acids encode a peptide having a pi 3.5 protein activity and being at least 
60% homologous, more preferably 70% homologous and most preferably 80% homologous 
with an amino acid sequence shown in SEQ ID No. 6. Nucleic acids which encode peptides 

10 having an activity of a pl3.5 protein, such as the ability to bind a CDK, and having at least 
about 90%, more preferably at least about 95%, and most preferably at least about 98-99% 
homology with a sequence shown in SEQ ID No. 6 are also withm the scope of the invention. 
Preferably, the nucleic acid is a cDNA molecule comprising at least a portion of the 
nucleotide sequence encoding a pi 3.5 protein shown in SEQ ID No. 5. A preferred portion 

15 of the cDNA molecule shown in SEQ ID No. 5 includes the coding region of the molecule. 

Another aspect of the invention provides a nucleic acid which hybridizes under high 
or low stringency conditions to a nucleic acid which encodes a CCR polypeptide havmg all 
or a portion of an amino acid sequence shown in one of SEQ ID Nos. 2, 4, 6 or 8. 
Appropriate stringency conditions which promote DNA hybridization, for example, 6.0 x 

20 sodium chloride/sodium citrate (SSC) at about 45X, followed by a wash of 2.0 x SSC at 
50X, are known to Aose skilled in the art or can be found in Current Protocols in Molecular 
Biology, John WUey & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, the salt concentration in 
the wash step can be selected from a low stringency of about 2.0 x SSC at 50*'C to a high 
stringency of about 0.2 x SSC at 50**C. In addition, the temperature in the wash step can be 

25 increased from low stringency conditions at room temperature, about 22X, to high 
stringency conditions at about 65*'C. 

Isolated nucleic acids which differ from the nucleotide sequences shown in one of 
SEQ ID Nos. 1, 3, 5 or 7 due to degeneracy in the genetic code are also within the scope of 
the invention. For example, a number of amino acids are designated by more than one triplet. 

30 Codons that specify the same amino acid, or synonyms (for example, CAU and CAC are 
synonyms for histidine) may result m "silent" mutations which do not affect the amino acid 
sequence of the protein. However, it is expected that DNA sequence polymorphisms that do 
lead to changes in the amino acid sequences of the subject CCR-proteins will exist among 
eukaryotic cells. One skilled m the art will appreciate fliat these variations in one or more 

35 nucleotides (up to about 3-4% of the nucleotides) of the nucleic acids encodmg a particular 
member of the CCR-protein family may exist among individuals of a given species due to 
natural allelic variation. Any and all such nucleotide variations and resulting amino acid 
polymorphisms are within the scope of this invention. 
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Fragments of the nucleic acid encoding a biologically active portion of the subject 
CCR-proteins are also within the scope of the invention. As used herein, a fragment of the 
nucleic acid encoding an active portion of a CCR-protein refers to a nucleotide sequence 
having fewer nucleotides than the nucleotide sequence encoding the full length amino acid 
5 sequence of, for example, the CCR-proteins represented in SEQ ID Nos. 2, 4, 6 or 8, and 
which encodes a peptide which retains at least a portion of the biological activity of the full- 
length protein (i.e., a peptide capable of binding a CDK) as defmed herein, or alternatively, 
which is functional as an antagonist of the biological activity of the full-length protein. 
Nucleic acid fragments within the scope of the invention include those capable of hybridizing 

10 under high or low stringency conditions with nucleic acids from other species, e.g. for use in 
screening protocols to detect homologs of the subject CCR-proteins. Nucleic acids within 
the scope of the invention may also contain linker sequences, modified restriction 
endonuclease sites and other sequences useful for molecular cloning, expression or 
purification of such recombinant peptides. 

15 As indicated by the examples set out below, a nucleic acid encoding a peptide having 

an activity of a CCR-protein may be obtained from mRNA or genomic DNA present in any 
of a number of eukaryotic cells in accordance with protocols described herein, as well as 
those generally known to those skilled in the art. A cDNA encoding a CCR-protein, for 
example, can be obtained by isolating total mRNA from a cell, e.g. a mammalian cell, e.g. a 

20 human cell. Double stranded cDNAs can then be prepared from the total mRNA, and 
subsequently inserted into a suitable plasmid or bacteriophage vector using any one of a 
number of known techniques. A gene encoding a CCR-protein can also be cloned using 
established polymerase chain reaction techniques in accordance with the nucleotide sequence 
information provided by the invention. Alternatively, the two hybrid assay described in the 

25 appended examples can be used to identify other pi 6 homologs, e.g. which have ankyrin-like 
repeats and bind to a cyclin dependent kinase, such as CDK6. 

A preferred nucleic acid is a cDNA homologous to a sequence shown in SEQ ID No. 
1. Another preferred nucleic acid is a cDNA including a sequence shown in SEQ ID No. 3. 
Yet another preferred nucleic acid is a cDNA including a sequence shown in SEQ ID No. 5. 

30 Another aspect of the invention relates to the use of the isolated nucleic acid in 

"antisense" therapy. As used herein, "antisense" therapy refers to administration or in situ 
generation of oligonucleotide probes or their derivatives which specifically hybridizes (e.g. 
binds) under cellular conditions, with the cellular mRNA and/or genomic DNA encoding a 
CCR-protein so as to mhibit expression of that protein, e.g. by mhibiting transcription and/or 

35 translation. The bmding may be by conventional base pair complementarity, or, for example, 
in the case of binding to DNA duplexes, through specific interactions in the major groove of 
the double helbc. In general, "antisense" therapy refers to the range of techniques generally 
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employed in the art, and includes any therapy which relies on specific binding to 
oligonucleotide sequences. 

An antisense construct of the present invention can be delivered, for example, as an 
expression plasmid which, when transcribed in the cell, produces RNA which is 
complementary to at least a unique portion of the cellular mRNA which encodes a CCR- 
protein. Alternatively, the antisense construct is an oligonucleotide probe which is generated 
ex vivo and which, when introduced into the cell causes inhibition of expression by 
Iqrbridizing with the mRNA and/or genomic sequences encoding one of the subject CCR 
proteins. Such oligonucleotide probes are preferably modified oligonucleotide which are 
resistant to mdogenous nucleases, e.g, exonucleases and/or endonucleases, and is therefore 
stable in vivo, ^emplaiy nucleic acid molecules for use as antisense oligonucleotides arc 
phosphoramidate, phosphotfaioate and methylphosphonate analogs of DNA (see also U.S. 
Patents 5,176,996; 5,264,564; and 5,256,775). Additionally, general ^>proaches to 
constructing oligomers useful in antisense therapy have been reviewed, for example, by van 
der krol et al. (1988) Biotechnigues 6:958-976; and Stein et al. (1988) Cancer Res 48:2659- 
2668. 

Accordingly, the modified oligomers of the invention are useful in therapeutic^ 
diagnostic, and research contexts. In therapeutic ^plications, the oligomers are utilized in a 
manner appropriate for antisense therapy in general. For such therapy, the oligomers of the 
invention can be formulated for a variety of modes of administration, including systemic and 
topical or localized administration. Techniques and formulations generally may be found in 
Remminpton's Pharmaceutical ficimpj^ Meade Publishing Co., Easton, PA. For systemic 
administration, injection is preferred, including intramuscular, intravenous, intraperitoneal, 
and subcutaneuos for injection, the oligomers of the invention can be formulated m Uquid 
solutions, preferably in physiologically compatible buffers such as Hank's solution or 
Ringer's solution. In addition, the oligomers may be formulated in solid form and 
redissolved or suspended immediately prior to use. Lyophilized forms are also included. 

Systemic administration can also be by transmucosal or transdermal means, or the 
compounds can be administered orally. For transmucosal or transdermal administration, 
penetrants appropriate to the banier to be permeated are used in the fomulation. Such 
penetrants are generaUy known in the art, and include, for example, for transmucosal 
administration bile salts and fiisidic acid derivatives. In addition, detergents may be used to 
fedlitate pomeation. Transmucosal administration may be through nasal qways or usmg 
suppositories. For oral administration, the oligomers are formulated into conventional oral 
administration forms such as capsules, tablets, and tonics. For topical administration, Ae 
oligomers of the invention are formulated into ointments, salves, gels, or creams as generally 
known in the art. 
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Preferred antisense oligonucleotides contain all or a unique fiagment of a CCR gene. 
The fragment can be, for example, 10, 20, 50, 75 or 100 nucleotides in length. The 
uniqueness of the sequence selected can be relative to all other non-CCR genes, e.g., the 
antisense nucleic acid will hybridizes under physiological conditions to several CCR 
5 genes/messages, or the construct can be selected to uniquely hybridizes to only one or a 
specific subset of CCR genes, e.g., by chosing a sequence unique even among the various 
CCR genes. 

In addition to use in therapy, the oligomers of the invention may be used as diagnostic 
reagents to detect the presence or absence of the target DNA or RNA sequences to which 

10 they specifically bmd. Such diagnostic tests are described in fiuther detail below. 

This invention also provides e>q)ression vectors comprising a nucleotide sequence 
encoding a subject cell-cycle regulatory protem and operably linked to at least one regulatory 
sequence. Operably linked is intended to mean that the nucleotide sequence is linked to a 
regulatory sequence m a manner which allows expression of the nucleotide sequence. 

15 Regulatory sequences are art-recognized and are selected to direct expression of the peptide 
having an activity of a CCR-protein. Accordingly, the term regulatory sequence includes 
promoters, enhancers and other expression control elements. Exemplary regulatory 
sequences are described in Goeddel; Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, CA (1990). For instance, any of a wide variety of 

20 compression control sequences-sequences that control the expression of a DNA sequence when 
operatively linked to it may be used in these vectors to express DNA sequences encoding the 
CCR-proteins of this invention. Such usefiil expression control sequences, mclude, for 
example, the early and late promoters of SV40, adenovirus or cytomegalovirus immediate 
early promoter, the lac system, the trp qrstem, the TAC or TRC system, T7 promoter whose 

25 expression is directed by T7 RNA polymerase, the major operator and promoter regions of 
phage lambda , the control regions for fd coat protein, the promoter for 3-phosphoglycerate 
kinase or odier glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the 
promoters of the yeast a-mating factors, the polyhedron promoter of the baculovirus system 
and other sequences known to control the expression of genes of prokaryotic or eukaryotic 

30 cells or their viruses, and various combmations thereof. It should be understood that the 
design of the expression vector may depend on such factors as the choice of the host cell to 
be transformed and/or the Xy^ of protein desired to be expressed. Moreover, the vector's 
copy number, the ability to control that copy number and the expression of any other proteins 
encoded by the vector, such as antibiotic markers, should also be considered. 

35 As will be apparent, the subject gene constructs can be used to cause e3q)ression of the 

subject CCR-proteins in cells propagated in culture, e.g. to produce proteins or peptides, 
including fusion proteins or peptides, for purification. In addition, recombmant expression of 
the subject CCR-proteins, or agonist forms thereof, in cultured cells can be usefiil for 
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preventing de-differmtiation of cells mviYra. To illustrate, in vi/ro neuronal culture systems 
have proved to be fundamental and indispensable tools for the study of neural development, 
as well as the identification of neurotrophic factors. Once a neuronal cell has become 
terminally-differentiated, it typically will not change to another terminally diflFerentiated cell- 
5 type. However, neuronal cells can nevertheless readily lose their differentiated state. This is 
commonly observed when they are grown in culture from adult tissue, and when they form a 
blastema during regeneration. 

Agonizing the function of the subject CCR-proteins, such as by maintaining 
expression (or overexpression) of pl6 or pi 5, provides a means for ensuring an adequately 

10 restrictive environment in order to maintain neuronal cells at various stages of differentiation, 
and can be employed, for instance, in cell cultures designed to test the specific activities of 
trophic factors. Other tissue culture systems which require maintenance of differentiation 
will be readily apparent to those skilled in the art In this respect, each of the agonist and 
antagonist of CCR mhibition of CDK4 and CDK6 can be used for ex vivo tissue generation, 

15 as for example, to enhance the generation of prosthetic cartilage devices for implantation. 

Conversely, by antagonizmg the activity of the wild-type CCR-proteins, such as by 
expression of antagonistic homologs, antisense constructs, or treatment with agents able to 
disrupt binding of a CCR-protein with a CDK, the cultured cells can be prevented fi-om 
following certain differentiative pathways, and, importantly, can cause transformation of 

20 cells in culture. In similar fashion, the dominant negative CDK4 and CDK6 mutants 
described below can be used to cause cellular transformation, as each of these CDK mutants 
is insensitive to pl6 inhibition and is effectively an antagonist of pi 6. The ability of CCR 
antagonists to promote cell growth is particularly significant in light of the observation that 
human cells are notoriously difficult to grow in vitro. Accordingly, such reagents are 

25 therefore useful for transforming, and in certain instances, immortalizii^, cells from primary 
cell cultures. 

Moreover, the subject gene constructs can also be utilized in diagnostic assays to 
determine if a cell's growth is no longer dependent on the regulatory function of a CCR- 
protein, e.g, m determining the phenotype of a transformed cell. To illustrate, a sample of 

30 cells &om the tissue can be obtained from a patient and dispersed in appropriate cell culture 
media, a portion of the cells in the sample can be caused to e)q)ress a recombinant CCR- 
protein, e.g, by transfection with a pl6, pl5 or pl3.5 expression vector, and subsequent 
growth of the cells assessed. The ability of cells to prolifiaate despite expression of tfie 
CCR-protein is indicative of a lack of dependence on cell regulatory pathways which include 

35 the CCR-protein, e.g. RB-mediated checkpoints. Dependmg on the nature of the tissue of 
mterest, the sample can be m the form of cells isolated from, for example, a blood sample, an 
exfoliated cell sample, a fine needle aspirant sample, or a biopsied tissue sample. Where the 
initial sample is a solid mass, the tissue sample can be minced or otherwise dispersed so that 
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cells can be cultured, as is known in the art. Such knowledge can have both prognostic and 
therapeutic benefits. 

This invention also pertains to a host cell transfected with a recombuiant CCR-gene in 
order to express a polypeptide having an activity of a CCR-protein. The host cell may be any 
5 prokaryotic or eukaryotic cell. For example, a CCR-protein of the present invention may be 
expressed in bacterial cells such as E. colU insect cells (baculovirus), yeast, or mammalian 
cells. Other suitable host cells are known to those skilled in the art. 

Another aspect of the present invention concerns recombinant CCR-proteins which 
are encoded by genes derived from eukaryotic organisms, e.g. mammals, e,g. humans, and 

1 0 which have at least one biological activity of a CCR-protein, or which are naturally occurring 
mutants thereof The term "recombinant protein" refers to a protein of the present invention 
which is produced by recombinant DNA techniques, wherein generally DNA encoding the 
CCR-protein is inserted into a suitable expression vector which is in turn used to transform a 
host cell to produce the heterologous protein. Moreover, the phrase "derived from", with 

15 respect to a recombinant gene encoding the recombinant CCR-protein, is meant to include 
within the meaning of "recombinant protein" those proteins having an amino acid sequence 
of a native CCR-protein, or an amino acid sequence similar thereto which is generated by 
mutations mcluding substitutions and deletions of a naturally occurring CCR-protein of a 
organism. To illustrate, recombinant proteins preferred by the present invention, in addition 

20 to native pi 5 or pi 3. 5 proteins, are those recombinantiy produced proteins which are at least 
60% homologous, more preferably 70% homologous and most preferably 80% homologous 
with an amino acid sequence shown in SEQ ID No. 2, 4, 6 or 8. Polypeptides having an 
activity of a CCR- protein, such as CDK-buiding, and having at least about 90%. more 
preferably at least about 95%, and most preferably at least about 98-99% homology with a 

25 sequence shown in SEQ ID No. 2, 4, 6 or 8 are also witiun tiie scope of the invention. In 
other preferred embodiments, die recombinant CCR-protein can be characterized by ankyrin- 
like repeats and the ability to bind to a cyclin dependent kinase, such as CDK4 or CDK6. 
Thus, the present invention pertains to recombinant CCR-proteins which are encoded by 
genes derived from a organism and which have amino acid sequences evolutionarily related 

30 to a CCR-protein represented by one of ID No. 2, 4, 6 or 8, wherein "evolutionarily related 
to", refers to CCR-proteins having amino acid sequences which have arisen naturally (e.g. by 
allelic variance or by diflferential splicing), as well as mutational variants of CCR-proteins 
which are derived, for example, by combinatorial mutagenesis. 

The present invention further pertains to methods of producing the subject CCR- 

35 proteins. For example, a host cell transfected witii expression vector encoding one of the 
subject CCR-protein can be cultured under appropriate conditions to allow expression of the 
peptide to occur. The peptide may be secreted and isolated from a mixture of cells and 
medium contaimng tiie peptide. Alternatively, tiie peptide may be retained cytoplasmically 
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and the cells harvested, lysed and the protein isolated. A cell culture includes host cells, 
media and other byproducts. Suitable media for cell culture are well known in the art. The 
peptide can be isolated from cell culture medium, host cells, or both using techniques known 
in the art for purifying proteins including ion-exchange chromatography, gel filtration 
5 chromatography, ultrafiltration, electrophoresis, and immxmoaffinity purification with 
antibodies specific for particular epitopes of the subject CCR-proteins. In a preferred 
embodiment, the CCR-protein is a fusion protein containing a domain which facilitates its 
purification, such as pl6-GST, plS-GST, or a pl3.S-GST fusion proteins. 

Thus, a nucleotide sequence derived &om the cloning of a CCR-protein of the present 

10 invention, encoding all or a selected portion of the protein, can be used to produce a 
recombinant form of the protein via microbial or eukaryotic cellular processes. Ligating the 
polynucleotide sequence into a gene construct, such as an expression vector, and transforming 
or transfecting into hosts, either eukaryotic (yeast, avian, insect or mammalian) or 
prokaryotic (bacterial cells), are standard procedures used in producing other well-known 

15 protems, e.g. insulin, interferons, human growth hormone, IL-1, IL-2, and the like. Similar 
procedures, or modifications thereof, can be employed to prepare recombinant CCR-proteins, 
or portions thereof, by microbial means or tissue-culture technology in accord with the 
subject invention. 

The recombinant CCR-protein can be produced by ligating the cloned gene, or a 

20 portion thereof, into a vector suitable for e?q)ression in either prokaryotic cells, eukaryotic 
cells, or both. Expression vehicles for production of a recombinant CCR-protein include 
plasmids and other vectors. For instance, suitable vectors for the expression of the subject 
polypeptides include plasmids of the types: pBR322*derived plasmids, pEMBL-derived 
plasmids, pEX-derived plasmids, pBTac-derived plasmids and pUC-derived plasmids for 

25 expression in prokaryotic cells, such as E. colu 

A number of vectors exist for the expression of recombinant proteins in yeast. For 
instance, YEP24, YIPS, YEP51, YEP52, pYES2, and YRP17 are clomng and expression 
vehicles useful in the introduction of genetic constructs into 5. cerevisiae (see, for example. 
Broach et al (1983) in Experimental Manipulation of Gene Expression^ ed. M. Inouye 

30 Academic Ptess, p. 83, incorporated by reference herein). These vectors can replicate in E. 
call due the presence of the pBR322 ori, and in 5. cerevisiae due to the replication 
•determinant of tiie yeast 2 micron plasmid. In addition, drug resistance markers such as 
ampicillin can be used. 

The preferred mammalian expression vectors contain both prokaryotic sequences to 

35 facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription 
units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, 
pSV2gpt, pSV2neo, pSV2-dhfi', pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived 
vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic 
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cells. Some of these vectors are modified with sequences from bacterial plasmids, such as 
pBR322, to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Altematively, derivatives of viruses such as the bovine papilloma virus 
(BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient 
5 expression of proteins in eukaryotic cells. Examples of other viral (including retroviral) 
expression systems can be found below in the description of gene therapy delivery systems. 
The various methods employed in the preparation of the plasmids and transformation of host 
organisms are well known in the art. For other suitable expression systems for both 
prokaryotic and eukaryotic cells, as well as general recombinant procedures, see Molecular 

10 Cloning A Laboratory Manual^ 2nd Ed., ed. by Sambrook, Fritsch and Maniatis (Cold Spring 
Harbor Laboratory Press: 1989) Chapters 16 ano 17. In some instances, it may be desirable to 
express the recombinant CCR-protein by the use of a baculovirus expression system. 
Examples of such baculovirus expressior systems include pVL-derived vectors (such as 
pVL1392, pVL1393 and pVL941), pAcUW-derived vectors (such as pAcUWl), and 

15 pBlueBac-derived vectors (such as the B-gal containing pBlueBac III). 

When expression of a carboxy terminal fragment of the full-length CCR-protein is 
desired, i.e. a truncation mutant, it may be necessary to add a start codon (ATG) to the 
oligonucleotide fragment containing the desired sequence to be expressed. It is well known 
in the art that a methionine at the N-terminal position can be en2ymaticaily cleaved by the use 

20 of the enzyme methionine aminopeptidase (MAP). MAP has been cloned from E. coli (Ben- 
Bassat et al. (1987) J. Bacteriol 169:751-757) and Salmonella typhimurium and its in vitro 
activity has been demonstrated on recombinant proteins (Miller et al. (1987) PNAS ^^:2718- 
1722). Therefore, removal of an N-tenninal methionine, if desired, can be achieved either in 
vivo by expressing such recombinant polypeptides in a host which produces MAP (e.g., E. 

25 coli or CM89 or S. cerevisiae), or in vitro by us i; of purified MAP (e.g., procedure of Miller et 
al.). 

Alternatively, the coding sequences for the polypeptide can be incorporated as a part 
of a fiision gene including a nucleotide sequence encoding a different polypeptide. This type 
of expression system can be usefiil under conditions where it is desirable to produce an 

30 immunogenic fragment of one of the subject CCR-proteins. For example, the VP6 capsid 
protein of rotavirus can be used as an immunologic carrier protein for portions of polypeptide, 
either in the monomeric form or in the form of a viral particle. The nucleic acid sequences 
corresponding to the portion of the CCR protein to which antibodies are to be raised can be 
incorporated into a fiision gene construct which includes coding sequences for a late vaccinia 

35 virus structural protein to produce a set of recombinant viruses expressing fiision proteins 
comprising a portion of the protein as part of the virion. The Hepatitis B surface antigen can 
also be utilized in this role as well. Similarly, chimeric constructs coding for fiision proteins 
containing a portion of a CCR-protein and the poliovirus capsid protein can be created to 
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enhance immunogenidty (see, for example, EP Publication No. 0259149; and Evans et al. 
(1989) Nature 339:385; Huang et al. (1988) J. Virol 62:3855; and Schlienger et al. (1992) 
J. Virol 66:2). 

The Multiple Antigen Peptide system for peptide-based immunization can be utilized, 
wherein, a desired portion of a CCR-protein is obtained directly from oigano-chemical 
synthesis of the peptide onto an oligomeric branching lysine core (see, for example, Posnett et 
al. (1988) JBC 263:1719 and Nardelli et al. (1992) J. Immunol 148:914). Antigenic 
determinants of the CCR-proteins can also be expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
q>preciated that fusion proteins can also fridlitate the repression of proteins. For example, 
the CCR-protein of 4e present invention can be generated as a glutathione-S-transferase 
(GST) fusion proteins. Such GST fusion proteins can be used to simply purification of the 
CCR-protein, such as Hirough the use of glutathione-derivatized matrices (see, for example. 
Current Protocols in Molecular Biology, eds. Ausabel et al. (N.Y.: John Wiley & Sons, 
1991)). 

In another embodiment, a fusion gene coding for a purification leader sequence, such 
as a poly-(His)/enterokinase cleavage site sequence at the N-terminus of the desired portion 
of the recombinant protein, can allow purification of the expressed fusion protein by afFmity 
chromatography using a Ni2+ metal resin. The purification leader sequence can then be 
subsequently removed by treatmrait with enterokinase to provide the purified CCR-protein 
(e.g., see Hochuli et al. (1987) J. Chromatography 411:177; and Janknecht et al. PNAS 
88:8972). 

Tedmiques for making fusion genes are well known. Essentially, the joining of 
various DNA fragments coding for diflferent polypeptide sequences is performed in 
accordance with conventional techniques, anploying blunt-ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for ^propriate termini, filling-in of 
cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, 
and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PGR 
anqplification of gene fragments can be carried out using anchor primers which give rise to 
complementary oveihangs between two consecutive gene fragments which can subsequently 
be annealed to generate a chimeric gene sequence (see, for example. Current Protocols in 
Molecular Biology, eds. Ausubel et al. John Wiley & Sons: 1992). 

The present invention also makes available isolated and/or purified forms of the 
subject CCR-proteins, which are isolated from, or otherwise substantially free of other 
extracellular proteins, especially cell-cycle proteins, e.g. CDKs, cyclins, p21, pl9, or PCNA, 
normally associated with the CCR-protein. The term "substantially free of other cellular 
proteins" (also referred to herein as "contaminating proteins") is defined as encompassing, for 
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example, pl6, plS or plS.S preparations comprising less than 20% (by diy weight) 
contaminating protein, and preferably comprises less than 5% contaminating protein. 
Functional forms of the CCR-proteins can be prepared, for the first time, as purified 
preparations by using a cloned gene as described herein. By "purified", it is meant, when 
5 referring to a polypeptide, that the indicated molecule is present in the substantial absence of 
other biological macromolecules, such as other proteins (particularly other cell-cycle proteins 
such as CDK4 or CDK6, as well as other contaminating proteins). The term "piirified" as 
used herein preferably means at least 80% by dry weight, more preferably in the range of 95- 
99% by weight, and most preferably at least 99.8% by weight, of biological macromolecules 

10 of the same type present (but water, buffers, and other small molecules, especially molecules 
having a molecular weight of less than 5000, can be present). The term "pure" as used herein 
preferably has the same numerical limits as "purified" immediately above. "Isolated" and 
"purified" do not encompass either natural materials in their native state or natural materials 
that have been separated into components (e.g., in an acrylamide gel) but not obtained either 

15 as pure (e.g. lacking contaminating proteins, or chromatogr^hy reagents such as denaturing 
agents and polymers, e.g. acrylamide or agarose) substances or solutions. 

However, the subject polypeptides can also be provided in pharmaceutically 
acceptable carriers for formulated for a variety of modes of administration, including 
systemic and topical or localized administration. Techniques and formulations generally may 

20 be found in Remmington's Pharmaceutical Sciences. Meade Publishing Cn ^ Fagfnn, PA In 
an exemplary embodiment, the CCR-polypeptide is proAdded for transmucosal or transdermal 
delivery. For such administration, penetrants appropriate to the barrier to be permeated aie 
used in the formulation with the CCR polypeptide. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration bile salts and fiisidic acid 

25 derivatives. In addition, detergents may be used to facilitate permeation. Transmucosal 
administration may be through nasal sprays or using suppositories. For topical 
administration, the oligomers of the invention are formulated into ointments, salves, gels, or 
creams as generally known in the art. 

Another aspect of the invention related to peptides derived from the fiill-length CCR 

30 proteins. Isolated peptidyl portions of the subject CCR-proteins can be obtained by screening 
peptides recombinantly produced fit)m the corresponding fi:agment of the nucleic acid 
encoding such peptides. In addition, fragments can be chemically synthesized using 
techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc 
chemistry. For example, a CCR-protem of the present invention may be arbitrarily divided 

35 into fragments of desired length with no overlap of the fragments, or preferably divided into 
overiapping fragments of a desired length. The fragments can be produced (recombinantly or 
by chemical synthesis) and tested to identify those peptidyl fragments which can fijnction as 
either agonists or antagonists of, for example, CDK4 activation, such as by microinjection 



wo 95/28483 PCTAJS95/04636 

-27- 

assays. In an illustrative embodiment, peptidyl portions of the subject CCR proteins can 
tested for CDK-binding activity, as well as inhibitoiy ability, by e3q}ression as, for example, 
thioredoxin fusion proteins each of which contains a discrete fragment of the CCR-protein 
(see, for example, U-S. Patents 5,270,181 and 5,292,646; and PCT publication W094/ 
5 02502). 

It is also possible to modify the structure of the subject CCR-proteins for such 
purposes as enhancing therapeutic or prophylactic eflRcacy, or stability (e.g., ex vivo shelf life 
and resistance to proteolytic degradation in vivo). Such modified polypeptides, when 
designed to retain at least one activity of the naturally-occurring form of the protein, arc 

10 considered functional equivalents of the cell-cycle regulatory proteins described in more 
detail herein. Such modified polypeptides can be produced, for instance, by amino acid 
substitution, deletion, or addition. 

Moreover, it is reasonable to expect, for example, that an isolated replacement of a 
leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, 

15 or a similar replacement of an amino acid with a structurally related amino acid (i.e, 
conservative mutations) will not have a major effect on the biological activity of the resulting 
molecule. Conservative replacements are those that take place within a family of amino acids 
that are related in their side chains. Genetically encoded amino acids are can be divided into 
four fEunilies: (1) acidic = aspartate, glutamate; (2) basic = lysine, arginine, histidine; (3) 

20 nonpolar = alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan; and (4) uncharged polar = glycine, asparagine, glutamine, cysteme, serine, 
threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine arc sometimes classified jomtly 
as aromatic amino acids. In similar fashion, the amino acid repertoire can be grouped as (1) 
acidic = aspartate, glutamate; (2) basic = lysine, arginine histidine, (3) aliphatic = glycine, 

25 alanine, valine, leucine, isoleucine, serine, threonine, with serine and threonine optionally be 
grouped separately as aliphatic-hydroxyl; (4) aromatic = phenylalanine, tyrosine, tryptophan; 
(5) amide = asparagine, glutamine; and (6) sulfur -containing = cysteine and methionine, 
(see, for example. Biochemistry, 2nd ed, Ed. by L. Stiyer, WH Freeman and Co.:1981). 
Whether a change in the amino acid sequence of a peptide results in a fimctional homolog can 

30 be readily determined by assessing the ability of the variant peptide to produce a response in 
cells in a fiishion similar to the wild-type protem. For instance, such variant forms of pl6 can 
be assessed for their ability to complement a pl6-deficient cell. Peptides in which more than 
one replacement has taken place can really be tested in the same manner. 

This invention further contemplates a method of generating sets of combinatorial 

35 mutants of the present CCR-proteins, as well as truncation mutants, and is especially useful 
for identifying potential variant sequences (e.g. homologs) that are functional in binding to a 
CDK, especially CDK4 or CDK6. The purpose of screening such combinatorial libraries is 
to generate, for example, novel pl6, pl5 or pl3.5 homologs which can act as either agonists 
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or antagonist, or alternatively, possess novel activities all together. To illustrate, pl6 and/or 
pi 5 homologs can be engineered by the present method to provide more efficient binding to 
CDK4, yet have a significantly reduced binding affinity for CDK6 relative to the naturally- 
occurring form of the protein. Thus, combinatorially-derived homologs can be generated 
5 which have a selective potency relative to a naturally occurring CCR-protein. Such proteins, 
when expressed from recombinant DNA constructs, can be used in gene therapy protocols. 

Likewise, mutagenesis can give rise to CCR homologs which have intracellular half- 
lives dramatically different than the corresponding wild-type protein: For example, the 
altered protein can be rendered either more stable or less stable to proteolytic degradation or 

10 other cellular process which result in destruction of, or otherwise inactivation of, theCCR- 
protein. Such homologs, and the genes which encode them, can be utilized to alter the 
envelope of pi 6, pi 5 or pl3,5 expression by modulating the half-life of the protein. For 
instance, a short half-life can give rise to more transient biological effects and, when part of 
an inducible expression system, can allow tighter control of recombinant CCR-protein levels 

15 within the cell. As above, such proteins, and particularly their recombinant nucleic acid 
constructs, can be used in gene therapy protocols. 

In similar fashion, CCR homologs can be generated by the present combinatorial 
approach to act as antagonists, in that they are able to interfere with the abiUty of the 
corresponding wild-type protein to regulate cell proliferation. 

20 In a representative embodiment of this method, the amino acid sequences for a 

population of CCR-protein homologs are aligned, preferably to promote the highest 
homology possible. Such a population of variants can include, for example, pi 6 and/or plS 
homologs from one or more species, or homologs from the same species but which differ due 
to mutation. Amino acids which appear at each position of the aligned sequences are selected 

25 to create a degenerate set of combinatorial sequences. The presence or absence of amino 
acids from an aligned sequence of a particular variant is relative to a chosen consensus length 
of a reference sequence, which can be real or artificial. In order to maintain the highest 
homology in alignment of sequences, deletions in the sequence of a variant relative to the 
reference sequence can be represented by an amino acid space (♦), while insertional 

30 mutations in the variant relative to the reference sequence can be disregarded and left out of 
the sequence of the variant when aligned. 

To illustrate, upon mspection of the pl6, pl5 and pl3.5 sequences (see Figure 6), it 
was noted that an internal fragment of the three CCR-proteins was highly conserved. Based 
on these alignments, combinatorial libraries can be generated from this portion, the members 

35 of which can be expressed in the absence of other portions of a CCR-protein, or as a part of a 
CCR-protein in which other portions of the protein are static (e.g. a pi 6 or pi 5 protein in 
which only residues Met-52 to Gly-135 or Met-54 to Gly-137, respectively, are varied by 
combinatorial mutagenesis). For instance, a library of CCR homologs can be generated based 
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on the sequence of the human pi 6 and plS proteins so as to have an amino acid sequence 
represented by the degenerate formula: 

Met-Met-Met-Gly-Xaa(l)-Xaa(2)-Xaa(3)-Val-Ala-Xaa(4)-Leu-Leu-Leu-Xaa(5)- 
5 Xaa(6)-Gly-Ala-Xaa(7)-Xaa(8)-Asn-Cys-Xaa(9>Asp-Pro-Xaa(10)-Thr-Xaa(ll)- 
Xaa(12)-Xaa(13)-Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr- 
Leu-Val-Val-Leu-His-Xaa(l 4)-Xaa(l 5)-Gly-Ala-Arg-Leu-Asp-Val-Arg.Asp-Ala- 
Trp-Gly-Arg-Leu-Pro-Xaa( 1 6)-Asp-Leu.Ala-Xaa(l 7)-01u-Xaa(l 8)-Gly-His- 
Xaa(19)-Asp-Xaa(20>Xaa(21)-Xaa(22)-Tyr-Leu-Arg-Xaa(23>Ala-Xaa(24>Gly 

10 

wherein each of Xaa(l)-Xaa(24) is selected from one of the amino acid residues of the same 
position in SEQ ID No. 2, 4, 6 or 8. 

Further expansion of the combinatorial library can be made by, for example, by 
including amino acids which would represent conservative mutations at one or more of the 

15 degenerate positions. Inclusion of such conservative mutations can give rise to a library of 
potential cell-cycle regulatory sequences represented by the above formula, but wherein 
Xaa(l) represents Ser, Thr, Asn or Gin; Xaa(2) represents Gly, Ala, Val, Leu, or He; Xaa(3) 
represents Arg, Lys or His; Xaa(4) represents Gly, Ala, Val, Leu, He, Asp or Glu; Xaa(5) 
represents Gly, Ala, Val, Leu, He, Asn or Gb; Xaa(6) represents Arg, Lys, His, Tyr or Phe; 

20 Xaa(7) represents Asp or Glu; Xaa(8) represents Pro, Gly, Ser or Thr; Xaa(9) represents Gly, 
Ala, Val, Leu, He, Asp or Glu; Xaa(10) represents Gly, Ala, Val, Leu, He, or an amino acid 
gap; Xaa(l 1) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(12) represents Phe, Tyr, Trp 
or an amino acid gap; Xaa(13) represents Ser or Thr; Xaa(14) represents Gly, Ala, Val, Leu, 
He, Arg, Lys or His; Xaa(15) represents Gly, Ala, Val, Leu, He, Ser or Thr; Xaa(16) 

25 rq)resents Gly, Ala, Val, Leu or He; Xaa(17) represents Glx; Xaa(18) represents Gly, Ala, 
Val, Leu, He, Lys, His or Arg; Xaa(19) represents Arg or Gin; Xaa(20) represents Gly, Ala, 
Val, Leu or He; Xaa(21) represents Gly, Ala, Val, Leu or He; Xaa(22) represents Gly, Ala, 
Val, Leu, He, Lys, His or Arg; Xaa(23) represents Gly, Ala, Val, Leu, He, Thr or Ser; Xaa(24) 
represents Gly, Ala, Val, Leu, He, Ser, Thr or an amino acid gap, where m this context, an 

30 amino acid gap is understood to mean the deletion of that amino acid position jfrom the 
polypeptide. Alternatively, amino acid replacement at degenerate positions can be based on 
steric criteria, e.g. isosteric replacement, without regard for polarity or charge of amino acid 
sidechains. Similarly, completely random mutagenesis of one or more of the variant 
positions pCaa) can be carried out. 

35 In a preferred embodiment, the combmatorial CCR library is produced by way of a 

degenerate library of genes encoding a library of polypeptides which each include at least a 
portion of potential CCR-protem sequences. For instance, a mixture of synthetic 
oligonucleotides can be enzymadcally ligated into gene sequences such that the degenerate 
set of potential CCR nucleotide sequences are expressible as individual polypeptides, or 
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alternatively, as a set of larger fusion proteins (e.g. for phage display) containing the set of 
CCR protein sequences therein. 

There are many ways by which the library of potential CCR homologs can be 
generated from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate 
5 gene sequence can be carried out in an automatic DNA synthesizer, and the synthetic genes 
then be ligated into an appropriate gene for expression. The purpose of a degenerate set of 
genes is to provide, in one mixture, all of the sequences encoding the desired set of potential 
CCR sequences. The synthesis of degenerate oligonucleotides is well known in the art (see 
for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, 

10 Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273- 
289; Itakura et al. (1984) Annu. Rev, Biochem. 53:323; Itakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1 :477. Such techniques have been employed 
in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 
249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. (1990) Science 249: 

15 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. Patents Nos. 5,223,409^ 
5,198,346, and 5,096,815). 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations, and, for that matter, for screening cDNA 
libraries for gene products having a certain property. Such techniques will be generally 

20 adaptable for rapid screening of the gene libraries generated by the combinatorial 
mutagenesis of CCR homologs. The most widely used techniques for screening large gene 
libraries typically comprises cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes imder conditions in which detection of a desired activity facilitates 

25 relatively easy isolation of the vector encoding the gene whose product was detected. Each of 
the illustrative assays described below are amenable to high through-put analysis as necessary 
to screen large numbers of degenerate sequences created by combinatorial mutagenesis 
techniques. 

In an illustrative embodiment of a screening assay, the candidate combinatorial gene 
30 products are displayed on the surface of a cell, and the ability of particular cells or viral 
particles to bmd a CDK, such as CDK4 or CDK6, via this gene product is detected in a 
spanning assay". For instance, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell (Ladner et al., WO 88/06630; Fuchs et al. (1991) 
Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140), and the resulting 
35 fusion protein detected by panning, e.g. using a fluorescently labeled molecule which binds 
the CCR-protein, e.g. FITC-CDK4, to score for potentially functional CCR homologs. Cells 
can be visually inspected and separated under a fluorescence microscope, or, where the 
morphology of the cell permits, separated by a fluorescence-activated cell sorter. 
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In similar fashion, the gene library can be expressed as a fusion protein on the surface 
of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences 
can be expressed on the surface of infectious phage, thereby conferring two significant 
benefits. First, since these phage can be applied to afiSnity matrices at very high 
5 concentrations, a large nimiber of phage can be screened at one time. Second, smce each 
infectious phage displays the combinatorial gene product on its surface, if a particular phage 
is recovered fix)m an affinity matrix in low yield, the phage can be amplified by another 
round of infection. The group of ahnost identical E. coli filamentous phages Ml 3, fd, and fl 
are most often used in phage display libraries, as either of the phage glll or gVIII coat 

10 proteins can be used to generate fusion proteins without disrupting the ultimate packaging of 
the viral particle (Ladner et al PCX publication WO 90/02909; Ganard et al., PCT 
publication WO 92/09690; Marks et al. (1992) J. Biol Chem. 267:16007-16010; Griffths et 
al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. 
(1992) PNAS 89:4457-4461). 

15 In an illustrative embodiment, the recombmant phage antibody system (RPAS, 

Pharmacia Catalog number 27-9400-01) can be easily modified for use in expressing and 
screening CCR combinatorial libraries of the present invention. For instance, the pCANTAB 
5 phagemid of the RPAS kit contains the gene which encodes the phage glll coat protein. 
The CCR combinatorial gene library can be cloned into the phagemid adjacent to the glll 

20 signal sequence such that it will be esxpx^sed as a glll fusion protein. After ligation, the 
phagemid is used to transform competent E. coli TGI cells. Transformed cells are 
subsequently infected with M13K07 helper phage to rescue the phagemid and its candidate 
CCR-gene insert. The resulting recombinant phage contain phagemid DNA encoding a 
specific candidate CCR-protein, and display one or more copies of the corresponding fusion 

25 coat protein. The phage-displayed candidate proteins which are capable of, for example, 
binding a CDK, are selected or enriched by panning. For instance, the phage library can be 
panned on glutathione immobilized CDK-GST fusion proteins, and unbound phage washed 
away fi-om the cells. The boxmd phage is then isolated, and if the recombinant phage express 
at least one copy of the wild type glll coat protein, they will retain their ability to infect £. 

30 coli. Thus, successive rounds of remfection of E, coliy and panning will greatly enrich for 
CCR homologs, e.g. pl6, pl5 or pl3.5 homologs, which can then be screened for further 
biological activities in order to differentiate agonists and antagonists. Subsequent selection, 
e.g. of a reduced set of variants firom the library, may then be based upon more meaningful 
criteria rather than simple CDK-bmding ability. For instance, intracellular half-life or 

35 inhibitory potency can become selection criteria in secondary screens. 

In light of the present disclosure, other forms of mutagenesis generally applicable will 
be apparent to those skilled in the art m addition to the aforementioned combinatorial 
mutagenesis. For example, pl6, pl5 or pl3.5 homologs (both agonist and antagonist forms, 
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and including fiagments) can be generated and screened using, for example, alanine scanning 
mutagenesis and the like (Ruf et al. (1994) Biochemistry 33:1565-1572; Wang et al. (1994) J. 
Biol. Chem. 269:3095-3099; Balint et al. (1993) Gene 137:109-118; Grodberg et al. (1993) 
Eur. J. Biochem. 218:597-601; Nagashima et al. (1993) J. Biol. Chem. 268:2888-2892; 
5 Lowman et al. (1991) Biochemistry 30:10832-10838; and Cunningham et al. (1989) Science 
244:1081-1085), by linker scanning mutagenesis (Gustin et al. (1993) Virology 193:653-660; 
Brown et al. (1992) Mol. Cell Biol. 12:2644-2652; McKnight et al. (1982) Science 232:316); 
by saturation mutagenesis (Meyers et al. (1986) Science 232:613); by PGR mutagenesis 
(Leung et al. (1989) Method Cell Mol Biol 1:11-19); or by random mutagenesis (Miller et al. 

10 (1992) A Short Course in Bacterial Genetics, CSHL Press, Cold Spring Harbor, NY; and 
GreOTer et al. (1994) Strategies in Mol Biol 7:32-34). 

Consequently, the invention also provides for reduction of the subject CCR-proteins 
to generate mimetics, e.g. pq)tide or non-peptide agents, which are able to mimic binding of 
the authentic CCR protein to a cyclin dq)endent kinase, e.g. CDK4 and/or CDK6. Such 

15 mutagenic techniques as described above, as well as the thioredoxin system, are also 
particularly useful for mapping the determinants of a CCR-protein vAsizYi participate in 
protein-protein interactions involved in, for example, binding of the subject CCR-protein to a 
CDK. To illustrate, the critical residues of a subject CCR-protein which are involved in 
molecular recognition of CDK4 can be determined and used to generate CCR-derived 

20 peptidomimetics which bind to CDK4 or CDK6 and, like the authentic CCR-protein, inhibit 
activation of the kinase. By employing, for exanq>le, scanning mutagenesis to map the amino 
acid residues of a particular CCR-protein involved in binding a cyclm dependent kinase, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to the kinase. For instance, non-hydrolyzable peptide 

25 analogs of such residues can be generated using benzodiazepine (e.g., see Freidinger et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988), azepine (e.g., see Hufifinan et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988). substituted gama lactam 
tings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 

30 Publisher: Leiden, Netherlands, 1988), keto-methylene pseudopeptides (Ewenson et al. 
(1986) J Med Chem 29:295; and Ewenson et al. in Peptides: Structure and Function 
(Proceedings of the 9d» American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 
1985), p-tum dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. 
(1986) J Chem Soc Perkin Trans 1:1231), and P-aminoalcohols (Gordon et al. (1985) 

35 Biochem Biophys Res Commun 126:419; and Dann et al. (1986) Biochem Biophys Res 
Commun U4 J\). 

In similar fashion, identification of mutations in CDK4 and/or CDK6 which effect 
binding to a CCR-protein can be used to identify potential peptidyl fiagments of 
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CDK4/CDK6 which can competitively bind a CCR-protein and interfere with its ability to 
inhibit the CDK. As describe below, we have charactwized mutations to CDK4 which 
abrogate binding by pl5 and pl6, and consequently provide kinases that are insensitive to 
inhibition by those CCR-proteins. These mutations, in fact, occur in stretches of amino acid 
5 residues which are conserved between CDK4 and CDK6. Accordingly, peptidomimetics 
based on these portions of CDK4/CDK6 might be useful as antagonists of CCR-proteins, in 
that they are expected to compete with CDK4 or CDK6 for binding to the CCR-protem. In a 
preferred embodimoit, the CCR antagonist is a peptide or non-peptide analog of the amino 
acid sequence VAEIG(V/E)GAYG(T/K)-V(FAOKARD, though more preferably a 

1 0 peptidomimetic of the amino acid sequence V(F/Y)KARD, and even more preferably of the 
tetrapeptide KARD. Likewise, another preferred peptide analog of CDK4/CDK6 comprises a 
peptide or peptidomimetic corresponding to *Jie sequence SRTDRE(I/l)K(V/L)TnLVFEHVD- 
QDL(Rn^TYLDK(AAOPPPG(LAO, though more preferably the peptidomimetic is an 
analog of the hexa-peptides FEHVDQ or EHv^QD, and even more preferably of Ae 

15 tetrapeptides HVDQ or EHVD. Non-hydrolyzable peptide analogs of such residues can be 
generated using, for example, benzodiazepine, azepine, substimted gama lactam rings, keto- 
methylene pseudopeptides, p-tum dipeptide cores, or p-aminoalcohols. These and other 
peptidyl portions of CDKs can be tested for binding to CCR-proteins such as pl5 or pl6 
using, for example, the tfaioredoxin fusion proteins constructs mention above. 

20 Another aspect of the invention pertams to an antibody specifically reactive with one 

of the subject CCR-proteins. For example, by usmg peptides based on the cDNA sequence 
of the subject pl6 protein, anti-pl6 antisera or anti-pl6 monoclonal antibodies can be made 
usmg standard methods. Likewise, anti-pl3.5 and anti-pl5 antibodies can be generated. A 
mammal such as a mouse, a hamster or rabbit can be immunized with an immunogenic form 

25 of the peptide (e.g., an antigenic fragment which is capable of eliciting an antibody response). 
Techniques for confening inmiunogemcity on a protem or peptide include conjugation to 
carriers or otiier techniques weU known in the art. For instance, a peptidyl portion of the 
protein represented by one of SEQ ID No. 2. 4, 6 or 8 can be administered in tiie presence of 
adjuvant. The progress of immunization can be monitored by detection of antibody titers in 

30 plasma or serum. Standard EUSA or other immunoassays can be used witii tiie immunogen 
as antigen to assess the levels of antibodies. 

FoUowmg immunization, anti-CCR antisera can be obtained and, if desired, 
polyclonal anti-CCR antibodies isolated from the serum. To produce monoclonal antibodies,' 
antibody producmg ceUs Oymphocytes) can be harvested from an immunized animal and 

35 fused by standard somatic cell fusion procedures wifli inmiortalizing cells such as myeloma 
cells to yield hybridoma ceUs. Such techniques are weU known in the art, an include, for 
example, flie hybridoma technique (originally developed by Kohler and Milstein, (1975) 
Nature, 256: 495-497), as the human B cell hybridoma technique (Kozbar et al,, (1983) 
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Immunology Today ^ 4: 72), and the EBV-hybridoma technique to produce hiiman monoclonal 
antibodies (Cole et al., (1985) Monoclonal Antibodies and Cancer Therapy^ Alan R. Liss, 
Inc. pp. 77-96). Hybridoma cells can be screened immunocheniically for production of 
antibodies specifically reactive with the CCR-protein of interest and the monoclonal 
5 antibodies isolated. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with a CCR-protein, e.g. anti-pl6, anti-pl5 or anti-pl3.5 antibodies. 
Antibodies can be fragmented using conventional techniques and the fragments screened for 
utility in the same manner as described above for whole antibodies. For example, F(ab*)2 

10 fragments can be generated by treating antibody with pepsin. The resulting F(ab*)2 fragment 
can be treated to reduce disulfide bridges to produce Fab' firagments. The antibody of the 
present invention is fiirther intended to include bispecific and chimeric molecules! 

Both monoclonal and polyclonal antibodies (Ab) directed against the subject CCR- 
proteins, and antibody fragments such as Fab' and F(ab')2, can be used to block the action of 

15 particular CCR and allow the study of the cell-cycle or cell proliferation. 

One application of anti-CCR antibodies is in the immunological screening of cDNA 
libraries constructed in expression vectors, such as Xgtll, Xgtl8-23, XZAP, and >.0RF8, 
Messenger libraries of this type, having coding sequences inserted in the correct reading 
fiiame and orientation, can produce fiision proteins. For instance, A^gtl 1 will produce fiision 

20 proteins whose amino tennini consist of fi-galactosidase amino acid sequences and whose 
carboxy tennini consist of a foreign polypeptide. Antigenic epitopes of a CCR-protein, such 
as proteins antigenically related to pl6, pl5 or pl3.5, can then be detected with antibodies, as, 
for example, reacting nitrocellulose filters lifted fit)m infected plates with an anti-CCR 
antibody. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the 

25 presence of CCR homologs, such as pl6, pl5 or pl3.5 homologs, can be detected and cloned 
from other sources. 

Antibodies which are specifically immunoreactive with one or more CCR-proteins of 
the present invention can also be used in immunohistochemical staining of tissue samples in 
order to evaluate the abundance and pattern of expression of the CCR-protein family, or 

30 particular members thereof. Anti-CCR antibodies can be used diagnostically in immuno- 
precipitation and inununo-blotting to detect and evaluate levels of one or more CCR-proteins 
in tissue or cells isolated fix)m a bodily fluid as part of a clinical testmg procedure. For 
instance^ such measurements can be usefid in predictive valuations of the onset or 
progression of tumors. Likewise, the ability to monitor certain CCR-piotein levels in an 

35 individual can allow determination of the efiBcacy of a given treatment regimen for an 
individual afflicted with such a disorder. Diagnostic assays using anti-CCR antibodies, such 
as anti-pl6 or anti-pl5 antibodies, can include, for example, immimoassays designed to aid 
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in early diagnosis of a neoplastic or hyperplastic disorder, e.g. the presence of cancerous cells 
in the sample, e.g. to detect cells in which a lesion of a CCR-gene has occuned. 

In addition, nucleotide probes can be generated from the cloned sequence of the 
subject CCR-proteins, which allow for histological screening of intact tissue and tissue 
samples for the presence of a CCR-protein encoding mRNA. Similar to the diagnostic uses 
of anti-CCR-protein antibodies, the use of probes directed to CCR-protein encoding mRNAs, 
or to genomic CCR-gene sequences, can be used for both predictive and therapeutic 
evaluation of allelic mutations which might be manifest in, for example, neoplastic or 
hyperplastic disorders (e.g. unwanted cell growth). Used in conjunction with anti-CCR 
protein antibocfy unmunoassays, the nucleotide probes can help fecilitate the determination of. 
the molecular basis for a developmental disorder which may involve some abnormality 
associated with expression (or lack thereof) of a CCR-protein. For instance, variation in 
CCR-protein synthesis can be differentiated ftom a mutation in the coding sequence. 

Accordingly, the present method provides a method for determining if a subject is at 
risk for a disorder characterized by unwanted cell proliferation. In preferred embodiments, 
method can be generally characterized as comprising detecting, in a tissue of said subject, the 
presence or absence of a genetic lesion characterized by at least one of (i) a mutation of a 
gene encoding a CCR-protem, such as pi 6, pi 5 or pl3.5 or (ii) the mis-expression of the 
CCR-gene. To illustrate, such genetic lesions can be detected by ascertaining the existence of 
at least one of 0) a deletion of one or more nucleotides from a CCR-gene, (ii) an addition of 
one or more nucleotides to a CCR-gene, (iii) a substitution of one or more nucleotides of a 
CCR-gene, (iv) a gross diromosomal rearrangement of a CCR-gene, (v) a gross alteration in 
the level of a messenger RNA transcript of a CCR-gene, (vi) the presence of a non-wild type 
spUcing pattern of a messenger RNA transcript of a CCR-gene, and (vii) a non-wild type level 
of a CCR-protein. In one aspect of the mvention, there is provided a probe^primer comprismg 
an oligonucleotide containing a region of nucleotide sequence which is capable of hybridizing 
to a sense or antisense sequence of any of SEQ ID Nos: 1, 3, 5 or 7 or naturally occurring 
mutants thereof, or 5' or 3' flanking sequences or intronic sequences naturally associated with 
the subject CCR-genes or naturally occurring mutants thereof. The probe is exposed to 
nucleic acid of a tissue sample; and the hybridization of the probe to the sample nucleic acid 
is detected. In certain embodiments, detection of the lesion comprises utilizing the 
probe/primer in a polymerase chain reaction (PGR) (see, e.g. U.S. Patent Nos. 4,683,195 and 
4,683,202), or, alternatively, in a ligation chain reaction (LCR) (see, e.g., Landegran et al. 
(1988) Science 241:1077-1080; and Nakazawa et al. (1944) PNAS 91:360-364), the later of 
which can be particularly usefiil for detectmg point mutations in Ae CCR-gene. 
Alternatively, the level of CCR-protein can detected m an immunoassay. 

Moreover, the use of anti-sense techniques (e.g. micromjection of antisense 
molecules, or tiansfection with plasmids v/hose transcripts are anti-sense with regard to a 
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certain CCR mRNA) can be used to investigate role of particular CCR-proteins, e.g. pl6, pl5 
or pl3.S, in the cell-cycle and cell proliferation, in a controlled environment, by inhibiting 
endogenous production of the protein. Such techniques can be utilized in cell culture, but can 
also be used in the creation of transgenic animals. 
5 Another aspect of the invention features transgenic non-human animals which express 

a heterologous CCR-gene of the present invention, or which have had one or more genomic 
CCR-gene(s), disrupted in at least one of the tissue or cell-types of the animal. For instance, 
transgenic mice that are disrupted at their pi 3.5 gene locus are described in Example S. 

In another aspect, the invention features an animal model for developmental diseases, 
10 which has a CCR allele which is mis-expressed. For example, a mouse can be bred which 
has a pl6 or pl5 allele deleted, or in which all or part of one or more pl6 exons are deleted. 
Such a mouse model can then be used to study disorders arising from mis-e?qpressed pl6 
genes. 

Accordingly, the present invention concerns transgenic animals which are comprised 

15 of cells (of that animal) which contain a transgene of the present invention and which 
preferably (though optionally) express an exogenous CCR-protein in one or more cells in the 
animal. The CCR-transgene can encode the wild-type form of the protein, or can encode 
homologs thereof, including both agonists and antagonists, as well as antisense constructs. In 
preferred embodiments, the expression of the transgene is restricted to specific subsets of 

20 cells, tissues or developmental stages utilizing, for example, cis-acting sequences that control 
expression in the desired pattern. In the present invention, such mosaic expression of the 
subject proteins can be essential for many forms of lineage analysis and can additionally 
provide a means to assess the effects of, for example, lack of pl6 and pi 5 inhibition of CDKs 
which might grossly alter development m small patches of tissue within an otherwise normal 

25 embryo. Toward this and, tissue-spedUSc regulatory sequences and conditional regulatory 
sequences can be used to control expression of the transgene in certain spatial patterns. 
Moreover, temporal patterns of expression can be provided by, for example, conditional 
recombination systems or prokaryotic transcriptional regulatory sequences. 

Genetic techniques which allow for the expression of transgenes can be regulated via 

30 site-specific genetic manipulation in vivo are known to those skilled in the art. For instance, 
genetic systems are available which allow for the regulated expression of a recombmase that 
catalyzes the genetic recombination a target sequence. As used herein, the phrase "target 
sequence" refers to a nucleotide sequence that is genetically recombined by a recombinase. 
The target sequence is flanked by recombinase recognition sequences and is generally either 

35 excised or inverted in cells expressing recombinase activity. Recombinase catalyzed 
recombination events can be designed such that recombination of the target sequence results 
in either the activation or repression of expression of the subject CCR polypeptide. For 
example, excision of a target sequence which interferes with the expression of a recombinant 



wo 95/28483 PCT/US95/04636 

-37- 

CCR-gene can be designed to activate expression of that gene. This interference with 
expression of the protein can result from a variety of mechanisms, such as spatial separation 
of the CCR-gene from the promoter element or an internal stop codon. Moreover, the 
transgene can be made wherein the coding sequence of the gene is flanked recombinanse 
5 recognition sequences and is initially transfected into cells in a 3' to 5' orientation with 
respect to the promoter element. In such an instance, inversion of the target sequence will 
reorient the subject gene by placing the 5* end of the coding sequence in an orientation with 
respect to the promoter element which allow for promoter driven transcriptional activation. 

In an illustrative embodunent, either the crelloxP recombinase system of 

10 bacteriophage PI (Lakso et al. (1992) PNAS 89:6232-6236; Orban et al. (1992) PNAS 
89:6861-6865) or the FLP recombmase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in 
vivo site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific 
recombination of an intervening target sequence located between loxP sequences. loxP 

15 sequences are 34 base pair nucleotide repeat sequences to which the Cre recombinase binds 
and are required for Cre recombinase mediated genetic recombination. The orientation of 
loxP sequences determines whether the intervening target sequence is excised or inverted 
when Cre recombinase is present (Abremski et al. (1984) J. Biol. Chem, 259:1509-1514); 
catalyzing the excision of the target sequence when the loxP sequences are oriented as direct 

20 repeats and catalyzes inversion of the target sequence when loxP sequences are oriented as 
inverted repeats. 

Accordingly, genetic recombination of the target sequence is dependent on e)q)ression 
of the Cre recombmase. Expression of the recombinase can be regulated by promoter 
elements which are subject to regulatory control, e.g., tissue-specific, developmental stage- 

25 specific, inducible or repressible by externally added agents. This regulated control will 
result in genetic recombination of the target sequence only in cells where recombinase 
expression is mediated by the promoter element Thus, the activation expression of the CCR 
protein can be regulated via regulation of recombinase expression. 

Use of the crelloxP recombinase system to regulate expression of a recombinant CCR 

30 protein, such as pl5 or pl6, requires the construction of a transgenic animal containing 
transgenes encoding both the Cre recombinase and the subject protein. Animals containing 
both the Cre recombinase and the recombinant CCR genes can be provided through the 
construction of "double" transgenic animals. A convenient method for providing such 
animals is to mate two transgenic animals each containing a transgene, e.g., the CCR-gene 

35 and recombinase gene. 

One advantage derived from initially constructing transgenic animals containing a 
CCR-transgene in a recombinase-mediated expressible format derives fiom the likelihood 
that the subject protein will be deleterious upon expression in the transgenic animal. In such 
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an instance^ a founder population, in which the subject transgene is silent in all tissues, can be 
propagated and maintained. Individuals of this founder population can be crossed with 
animals expressing the recombinase in, for example, one or more tissues. Thus, the creation 
of a founder population in which, for example, an antagonistic pi 5 transgene is silent will 
5 allow the study of progeny from that founder in which disruption of cell cycle regulation by 
pi 5 in a particular tissue or at developmental stages would result in, for example, a lethal 
phenotype. 

Similar conditional transgenes can be provided using prokaiyotic promoter sequences 
which require prokaiyotic proteins to be simultaneous expressed in order to facilitate 

10 expression of the transgene. Exemplary promoters and the corresponding trans-activating 
prokaryotic proteins are given in U.S. Patent No. 4,833,080. Moreover, e3q)ression of the 
conditional transgenes can be induced by gene therapy-like methods wherein a grae encoding 
the trans-activating protem, e.g. a recombinase or a prokaiyotic protein, is delivered to the 
tissue and caused to be expressed, such as in a cell-type specific manner. By this method, the 

15 CCR-transgene could remain silent into adulthood until "turned on" by the mtroduction of the 
trans-activator. 

In an exemplary embodiment, the "transgenic non-human animals" of the invention 
are produced by introducing transgenes into the geimline of the non-human animal. 
Embryonal target cells at various developmental stages can be used to introduce transgenes. 

20 Diflfercnt methods are used depending on the stage of development of the embryonal target 
cell. The ^gote is the best target for micro-injection. In the mouse, flie male pronucleus 
reaches the size of approximately 20 micrometers in diameter which allows reproducible 
injection of l-2pl of DNA solution. The use of ^gotes as a target for gene transfer has a 
major advantage in that in most cases the mjected DNA will be incorporated into the host 

25 gene before the first cleavage (Brinster et al. (1 985) PNAS 82:4438-4442). As a consequence, 
all cells of the transgenic non-hmnan animal will carry the incorporated transgene. This will 
in general also be reflected in the efficient transmission of the transgene to offspring of the 
founder since 50% of the germ cells will harbor the transgene. Microinjection of zygotes is 
the preferred method for incorporating transgenes in practicing the invention. 

30 Retroviral infection can also be used to introduce transgene into a non-human animal. 

The developing non-human embryo can be cultured in vitro to the blastocyst stage. During 
this time, the blastomeres can be targets for retroviral infection (Jaenich, R. (1976) PNAS 
73:1260-1264). EflBcient infection of the blastomeres is obtamed by enzymatic treatment to 
remove the zona pellucida {Manipulating the Mouse Embryo, Hogan eds. (Cold Spring 

35 Harbor Laboratory Press, Cold Spring Harbor, 1986). The viral vector system used to 
introduce the transgene is typically a replication-defective retrovirus carrying the transgene 
(Jahner et al. (1985) PNAS 82:6927-6931; Van der Putten et al. (1985) PNAS 82:6148-6152). 
Transfection is easily and eflSciently obtained by culturing the blastomeres on a monolayer of 
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virus-producing cells (Van der Putten, svpra\ Stewart et al. (1987) EMBO J. 6:383-388). 
Alternatively, infection can be performed at a later stage. Virus or virus-producing cells can 
be injected into the blastocoele (Jahner et al. (1982) Nature 298:623-628). Most of the 
founders will be mosaic for the transgene since incorporation occurs only in a subset of the 
5 cells which formed the transgenic non-himian animal. Further, the founder may contain 
various retroviral insertions of the transgene at different positions in the genome which 
generally will segregate in the offspring. In addition, it is also possible to introduce 
transgenes into the germ line by intrauterine retroviral infection of the midgestation embryo 
(Jahner et al. (1982) supra). 

10 A third type of target cell for transgene introduction is the embryonal stem cell (ES). 

ES cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (Evans et al. (1981) Nature 292:154-156; Bradley et al. (1984) Nature 309:255-258; 
Gossler et al. (1986) PNAS 83: 9065-9069; and Robertson et al. (1986) Nature 322:445-448). 
Transgenes can be eflBciently introduced into the ES cells by DNA transfection or by 

15 retrovirus-mediated transduction. Such transformed ES cells can thereafter be combined with 
blastocysts from a non-human ammal. The ES cells thereafter colonize the embryo and 
contribute to the germ line of the resulting chimeric animal. For review see Jaenisch, R. 
(1988) Science 240:1468-1474. 

Methods of making knock-out or disruption transgenic animals are also generally 

20 known. See, for example, Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1986). Recombinase dependent knockouts can also be 
generated, e.g. by homologous recombination to insert target sequences, such that tissue 
specific and/or temporal control of inactivation of a CCR-gene can be controlled as above. 

Yet another aspect of the invention pertains to methods of treating proliferative and/or 

25 differentiative disorders ^iiich arise from cells which, despite aberrant growth control, still 
require CDK4 or CDK6 for cell growth. There are a wide variety of pathological cell 
proliferative conditions for which the CCR-gene constructs and CCR-mimetics of the present 
invention can provide therapeutic benefits, with the general strategy being the inhibition of an 
anomalous cell proliferation. For instance, the gene constructs of the present invention can 

30 be used as a part of a gene therapy protocol, such as to reconstitute the function of a CCR- 
protein, e.g. pl6, pl5 or pl3.5, in a cell in which the protein is misexpressed or in which 
-signal transduction pathways upstream of the CCR-protein are dysfunctional. To illustrate, 
cell types which exhibit pathological or abnormal growth presumably dependent at least in 
part on a function of a CCR-protein include various cancers and leukemias, psoriasis, bone 

35 diseases, fibroproliferative disorders such as involving connective tissues, atherosclerosis and 
other smooth muscle proliferative disorders, as well as chronic inflammation. In addition to 
proliferative disorders, the treatment of differentiative disorders which result from, for 
example, de-differentiation of tissue which may (optionally) be accompanied by abortive 
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reentry into mitosis. Such degenerative disorders include chronic neurodegenerative diseases 
of the nervous system, including Alzheimer's disease, Parkinson's disease, Huntington's 
chorea, amylotrophic lateral sclerosis and the like, as well as spinocerebellar degenerations. 
Other differentiative disorders include, for example, disorders associated with connective 
5 tissue, such as may occur due to de-differentiation of chondrocjrtes or osteocytes, as well as 
vascular disorders which involve de-differentiation of endothelial tissue and smooth muscle 
cells, gastric ulcers characterized by degenerative changes in glandular cells, and renal 
conditions marked by failure to differentiate, e.g. Wilm's tumors. It will also be apparent 
that, by transient use of gene therapy constructs of the subject CCR proteins (e.g. agonist and 

10 antagonist forms), in vivo reformation of tissue can be accomplished, e.g. in the development 
and maintenance of organs. By controlling the proliferative and differentiative potential for 
different cells, the subject gene constructs can be used to reform injured tissue, or to improve 
grafting and morphology of transplanted tissue. For instance, CCR agonists and antagonists 
can be employed therapeutically to regulate organs after physical, chemical or pathological 

15 insult. For example, gene therapy can be utilized in liver repair subsequent to a partial 
hepatectomy, or to promote regeneration of lung tissue in the treatment of emphysema. 

For instance, as described in the Examples below, transformation of a cell can be due 
in part to a loss-of-function mutation to a particular CCR-gene, e.g., ranging from a point 
mutation to gross deletion of the gene. Additionally, other data provided in the appended 

20 examples suggests that disorders susceptible to treatment with CCR agonists include those 
arising from cells which have lost the abUity to induce CCR-protein expression. Normal cell 
proliferation, for instance, is generally marked by responsiveness to negative autocrine or 
paracrine growth regulators, such as members of the TGF-p family, e.g. TGF-pi, TGF-p2 or 
TGF-p3, and related polypeptide growth inhibitors, e.g. activins, inhibins, Mfillerian 

25 inhibiting substance, decapentaplegic, bone morphogenic factors, and vgl (e.g. terminal 
differentiation inducers). Ordinarily, control of cellular proliferation by such growth 
regulators, particularly in epithelial and hemopoietic cells, is in the form of growdi inhibition. 
This is generally accompanied by differentiation of the cell to a post-mitotic phenotype. 
However, it has been observed that a significant percentage of human cancers derived from 

30 these cells types display a reduced responsiveness to growtix regulators such as TGF-p. For 
instance, some tumors of colorectal, liver epithelial, and epidermal origin show reduced 
■sensitivity and resistance to the growth-inhibitory effects of TGF-p as compared to tiieir 
normal counterparts. In this context, a noteworthy characteristic of several retinoblastoma 
cell lines is the absence of detectable TGF-p receptors. Treatment of such tumors with CCR 

35 agonists (e.g. CCR-proteins delivered by gene therapy or CCR-mimetics) provides an 
opportunity to restore the function of the RB-mediated checkpoint 
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However, it will be appreciated that the subject method can be used to inhibit 
jMoliferation of cells which, in general, are still reliant on cyclin depaident kinases active in 
Oi, e.g. CDK4 or CDK6, irrespective of involvement of RB or RB-like proteins. 

In accordance with the subject meAod, expression constructs of the subject CCJl- 
5 proteins may be administered in any biologically effective carrier, c.g. any formulation or 
composition capable of effectively transfecting cells in vivo with a recombmant CCR-gene. 
Approaches include insertion of the subject gene in viral vectors including recombinant 
retroviruses, adenovirus, adeno-associated virus, and herpes simplex virus-1, or recombinant 
bacterial or eukaiyotic plasmids. Viral vectors can be used to transfect cells directly; 
10 plasmid DNA can be delivered with the help of, for example, cationic liposomes (lipofectin) 
or derivatized (e.g. antibody conjugated), polylysine conjugates, gramacidin S, artificial viral 
envelopes or other such intracellular carriers, as weU as direct injection of the gene construct 
or CaP04 precipitation earned out in vivo. It will be appreciated that because transduction of 
appropriate target ceUs represents the critical first step in gene therapy, choice of the 
1 5 particular gene delivery system will depend on such factors as the phenotype of the intended 
target and the route of administration, e.g. locally or systemically. 

A preferred approach for in vivo introduction of nucleic acid encoding one of the 
subject proteins into a cell is by use of a viral vector containing nucleic acid, e.g. a cDNA, 
encoding the gene product. Infection of ceUs with a viral vector has the advantage that a 
20 large proportion of the targeted cells can receive the nucleic acid. Additionally, molecules 
encoded within the viral vector, e.g., by a cDNA contained in the viral vector, are expressed 
efficiently in cells which have taken up viral vector nucleic add. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombmant gene deUvery system of choice for the transfer of exogenous genes in vivo, 
25 particularly into humans. These vectors provide efficient deUvery of genes into ceUs, and the 
transferred nucleic acids are stably integrated into the chromosomal DNA of tiie host A 
major prerequisite for the use of retroviruses is to ensure the safety of their use, particularly 
with regard to the possibiUty of the spread of wild-type virus in the ceU population. The 
development of specialized cell lines (termed "packaging cells") which produce only 
30 replication-defective retroviruses has increased the utility of retroviruses for gene therapy, 
and defective retroviruses are well characterized for use in gene transfer for gene therapy 
purposes (for a review see MiUer, A.D, (1990) Blood 76:271). Thus, recombinant retrovirus 
can be constructed in which part of the retroviral codmg sequence (gag, pol, env) has been 
replaced by nucleic add encoding one of Ae subject CCR-proteins, rendering the retrovirus 
35 replication defective. The replication defective retrovirus is then packaged into virions which 
can be used to infect a target cell through die use of a helper virus by standard techniques. 
Protocols for producing recombinant retroviruses and for infecting cells in vitro or in vivo 
with such viruses can be found in Current Protncols in Mnl^rn lar Rinlnpy^ Ausubel, F.M. et 
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al, (eds,) Greene Publishing Associates, (1989), Sections 9.10-9.14 and other standard 
laboratory manuals. Examples of suitable retroviruses include pU, pZIP, pWE and pEM 
which are well known to those skilled in the art. Examples of suitable packagmg virus lines 
for preparing both ecotropic and amphotropic retroviral systems include yCrip, yCre, \|/2 
5 and v|;Am. Retroviruses have been used to introduce a variety of genes into many different 
cell types, including neural cells, epithelial cells, endothelial cells, lymphocytes, myoblasts, 
hepatocytes, bone marrow cells, in vitro and/or in vivo (see for example Eglitis, et al. (1985) 
Science 230:1395-1398; Danos and Mulligan (1988) Proc. Natl Acad Sci. USA 85:6460- 
6464; Wilson et al. (1988) Proc, Natl. Acad Sci, USA 85:3014-3018; Armentano et al. 

10 (1990) Proc. Natl Acad Sci. USA 87:6141-6145; Huber et al. (1991) Proc, Natl Acad Set 
USA 88:8039-8043; Feny et al. (1991) P-oc, Natl Acad Set USA 88:8377-8381; 
Chowdhuiy et al. (1991) Science 254:1802-1805; van Beusechem et al. (1992) Proa Natl 
Acad Set USA 89:7640-7644; Kay et al. (T 992) Human Gene Therapy 3:641-647; Dai et al. 
(1992) Proc. Natl Acad ScL USA 89:10892-10895; Hwuetal. (1993),/. Immunol 150:4104- 

15 4115; U.S. Patent No. 4,868,116; U.S. Patent No. 4,980,286; PCT Application WO 
89/07136; PCT Application WO 89/02468; PCT Application WO 89/05345; and PCT 
Application WO 92/07573). 

In choosing retroviral vectors as a gene delivery system for the subject CCR-genes, it 
is important to note that a prerequisite for the successful infection of target cells by most 

20 retroviruses, and therefore of stable introduction of the recombinant CCR-gene, is that the 
target cells must be dividing. In general, this requirement will not be a hindrance to use of 
retroviral vectors to deliver CCR-gene constructs. In fact, such limitation on infection can be 
beneficial in circumstances wherein the tissue (e.g. nontransformed cells) surrounding die 
target cells does not undergo extensive cell division and is therefore refractory to infection 

25 with retroviral vectors. 

Furthermore, it has been shown ibat it is possible to limit the infection spectrum of 
retroviruses and consequently of retroviral-bascd vectors, by modifying the viral packaging 
proteins on the surface of the viral particle (see, for example PCT publications W093/25234, 
WO94/06920, and W094/11524). For instance, strategies for the modification of the 

30 infection spectrum of retroviral vectors include: coupling antibodies specific for cell surface 
antigens to the viral env protein (Roux et al. (1989) PNAS 86:9079-9083; Man et al. (1992) 
J. Gen Virol 73:3251-3255; and Goud et al. (1983) Virology 163:251-254); or coupling cell 
suifiice ligands to the viral env proteins (Neda et al. (1991) J Biol Chem 266:14143-14146). 
Coupling can be in the form of the chemical cross-linking with a protein or other variety (e.g. 

35 lactose to convert the env protein to an asialoglycoprotem), as well as by generating fusion 
proteins (e.g. single-chain antibody/c/jv fusion proteins). This technique, while useful to 
limit or otherwise direct the infection to certain tissue types, and can also be used to convert 
an ecotropic vector in to an amphotropic vector. 
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Moreover, use of retroviral goie deliveiy can be further enhanced by the use of tissue- 
or cell-specific transcriptional regulatory sequences vAdch control expression of the CCR- 
gene of the retroviral vector. 

Another viral gene delivery system useful in the present invention utilizes adenovirus- 
derived vectors. The genome of an adenovmis can be manipulated such that it encodes a 
gene product of interest, but is inactivate in terms of its ability to replicate in a normal lytic 
viral life cycle (see, for example, Berkner et al. (1988) BioTechniques 6:616; Rosenfeld et al. 

(1991) Science 252:431-434; and Rosenfeld et al. (1992) Cell 68:143-155). Suitable 
adenoviral vectors derived fi?om the adenovirus strain Ad type 5 dl324 or other strains of 
adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art Recombinant 
adenoviruses can be advantageous in certain circumstances in that they are not capable of 
infecting nondividing cells and can be used to infect a wide variety of cell types, including 
airway epithelium (Rosenfeld et al. (1992) cited st^ra), endotfaeUal ceUs (Lemarchand et al. 

(1992) Proc. Natl. Acad. Sci. USA 89:6482-6486), hepatocytes (Heiz and Gerard (1993) 
Proa. Natl. Acad Sci. USA 90:2812-2816) and muscle cells (Quantin et al. (1992) Proc. Natl 
Acad Sci. USA 89:2581-2584). Furthermofe, the virus particle is relatively stable and 
amenable to purification and concentration, and as above, can be modified so as to affect the 
spectrum of infectivity. Additionally, introduced adenoviral DNA (and foreign DNA 
contained therein) is not integrated into the genome of a host cell but remains episomal, 
thereby avoiding potential problems that can occur as a result of insertional mutagenesis in 
situations where introduced DNA becomes integrated into the host genome (e.g., retroviral 
DNA). Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large 
(up to 8 kilobases) relative to otber gene delivery vectors (Berimer et al., jwpro; Haj-Ahmand 
and Graham (1986)7. Virol. 57:267). Most replication-defective adenoviral vectors cunenUy 
in use and therefore favored by the present inve.don are deleted for all or parts of the viral 
El and E3 genes but retain as much as 80% of the adenoviral genetic material (see, e.g., 
Jones et al. (1979) Cell 16:683; Berknei et al.. supra; and Graham et al. in Methods in 
Molecular Biology. E.J. Murray, Ed. (Humana, Clifton, NJ, 1991) vol. 7. pp. 109-127). 
Expression of the inserted CCR-gene can be under control of, for example, the ElA 
promoter, fhs major late promoter (ML?) and associated leader sequences, the E3 promoter, 
or exogenously added promoter sequences. 

Yet another vital vector system usefid for delivery of the subject CCR-gene is the 
adeno-associated virus (AAV). Adeno-associated virus is a naturally occurring defective 
virus that reqmres another virus, such as an adenovmis or a herpes virus, as a helper virus for 
efBcient replication and a productive life cycle. (For a review see Mu^czka et al. Curr. 
Topics in Micro, andlmmunol. (1992) 158:97-129). It is also one of the few viruses that may 
integrate its DNA into non-dividing cells, and exhibits a high frequency of stable integration 
(see for example Flotte et al. (1992) Am. J. Respir. Cell Mol. Biol. 7:349-356; Samulski et al. 
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(1989) J, Virol 63:3822-3828; and McLaughlin et al. (1989) 7. Virol 62:1963-1973). 
Vectors containing as little as 300 base pairs of AAV can be packaged and can integrate. 
Space for exogenous DNA is limited to about 4.5 kb. An AAV vector such as that described 
in Tratschin et al. (1985) Mol Cell Biol 5:3251-3260 can be used to introduce DNA into 
5 cells. A variety of nucleic acids have been introduced into different cell types using AAV 
vectors (see for example Hermonat et al. (1984) Proc, Natl Acad ScL USA 81:6466-6470; 
Tratschin et al. (1985) Mol Cell Biol 4:2072-2081; Wondisford et al. (1988) Mol 
Endocrinol 2:32-39; Tratschin et al. (1984) J, Virol 51:611-619; and Flotte et al. (1993) J. 
Biol ChenL 268:3781-3790). 
10 Other viral vector systems that may have application in gene therapy have been 

derived from herpes virus, vaccinia virus, and several RNA viruses. In particular, hexx)es 
virus vectors may provide a unique strategy for persistence of the recombinant CCR-gene in 
cells of the central nervous system and ocular tissue (Pepose et al. (1994) Invest Ophthahnol 
Vis Sci 35:2662-2666) 

15 In addition to viral transfer methods, such as those illustrated above, non-viral 

methods can also be employed to cause expression of a CCR-protein in the tissue of an 
animal. Most nonviral methods of gene transfer rely on normal mechanisms used by 
mammalian cells for the uptake and intracellular transport of macromolecules. In preferred 
embodiments, non-viral gene delivery systems of the present invention rely on endocytic 

20 pathways for the uptake of the subject CCR-gene by the targeted cell. Exemplary gene 
delivery sfystems of this type include liposomal derived systems, poly-lysine conjugates, and 
artificial viral envelopes. 

In a representative embodiment, a gene encoding one of the subject CCR-proteins can 
be entrapped m liposomes bearing positive charges on their surfiice (e.g., lipofectins) and 

25 (optionally) which are tagged with antibodies against cell surface antigens of the target tissue 
(Mizuno et al. (1992) No Shinkei Geka 20:547-55 1 ; PCT publication WO91/06309; Japanese 
patent application 1047381; and European patent publication EP-A-43075). For example, 
lipofection of neuroglioma cells can be carried out using liposomes tagged with monoclonal 
antibodies against glioma-associated antigen (Mizuno et al. (1992) Neurol Med, Chir. 

30 32:873-876). 

In yet another illustrative embodiment, the gene delivery system comprises an 
antibody or cell surface ligand which is cross-linked with a gene binding agent such as poly- 
lysine (see, for example, PCT publications WO93/04701, W092/22635, WO92/20316, 
W092/19749, and WO92/06180). For example, the subject CCR-gene construct can be used 
35 to transfect hepatoQrtic cells in vivo using a soluble polynucleotide carrier comprising an 
asialoglycoprotein conjugated to a polycation, e.g. poly-lysine (see U.S. Patent 5,166,320). It 
will also be appreciated that effective delivery of the subject nucleic acid constructs via - 
mediated endocytosis can be improved using agents which enhance escape of the gene from 
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the endosomal structures. For instance, whole adenovirus or fusogenic peptides of the 
influenza HA gene product can be used as part of the delivery system to induce efficient 
disruption of DNA-containing endosomes (Mulligan et al. (1993) Science 260-926; Wagner 
et al. (1992) PNAS 89:7934; and Christian© et al. (1993) PNAS 90:2122). 

In clinical settings, the gene delivery systems can be introduced into a patient by any 
of a number of methods, each of which is familiar in the art. For instance, a pharmaceutical 
preparation of the gene delivery system can be introduced systemically, e.g. by intravenous 
injection, and specific transduction of the in the target cells occurs predominantly from 
specificity of transfection provided by the gene delivery vehicle, cell-type or tissue-type 
expression due to the transcriptional regulatoiy sequences controlling expression of the gene, 
or a combination thereof. In other embodiments, initial delivery of the recombinant gene is 
more limited with introduction into the animal being quite localized- For example, the gene 
delivery vehicle can be introduced by catheter (see U.S. Patent 5,328,470) or by stereotactic 
injection (e.g. Chen et al. (1 994) PNAS 91 : 3054-3057). 

Moreover, the pharmaceutical preparation can consist essentially of the gene delivery 
system in an acceptable diluent, or can comprise a slow release matrix in which the gene 
delivery vehicle is imbedded. Alternatively, where the complete gene delivery system can be 
produced in tact from recombinant ceUs, e.g. retroviral packages, the pharmaceutical 
preparation can comprise one or more ceils which produce the gene delivery system. In the 
case of the latter, methods of introducmg the viral packaging cells may be provided by, for 
example, rechargeable or biodegradable devices. Various slow release polymeric devices 
have been developed and tested in vivo in recent years for the controlled delivery of drugs, 
including proteinacious biopharmaceuticals, and can be adapted for release of viral particles 
through the manipulation of the polymer composition and form. A variety of biocompatible 
polymers (including hydrogels), including both biodegradable and non-degradable polymers, 
can be used to form an implant for the sustained release of an the viral particles by cells 
implanted at a particular target site. Such embodiments of the present invention can be used 
for the delivery of an exogenously purified virus, wdiich has been incorporated in the 
polymeric device, or for the delivery of viral particles produced by a cell encapsulated in the 
polymeric device. 

By choice of monomer composition or polymerization technique, the amount of 
water, porosity and consequent permeability characteristics can be controlled. The selection 
of the shape, size, polymer, and mefliod for unplantation can be determined on an mdividual 
basis according to the disorder to be treated and tixe individual patient response. The 
generation of such implants is generally known in the art. See, for example, Concise 
Encyclopedia of Medical & Dental Materials^ ed. by David Williams (MIT Press: 
Cambridge, MA, 1990); and the Sabel et al. U.S. Patent No. 4,883,666. In another 
embodiment of an implant, a source of cells producing a tiie recombinant virus is 
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encapsulated in implantable hollow fibers. Such fibers can be pre-spun and subsequently 
loaded with the viral source (Aebischer et al. U.S. Patent No. 4,892,538; Aebischer et al. U.S. 
Patent No. 5,106,627; Hof&ian et al. (1990) Expt, Neurobiol 1 10:39-44; Jaeger et al. (1990) 
Prog. Brain Res, 82:41-46; and Aebischer et al. (1991) 7. Biomech Eng. 113:178-183), or 
5 can be <x>-extruded with a polymer which acts to form a polymeric coat about the viral 
packaging cells (Lim U.S. Patent No. 4,391,909; Sefton U.S. Patent No. 4,353,888; Sugamori 
et al. (1989) Trans, Am. Artif Intern, Organs 35:791-799; Sefton et al. (1987) Biotechnol 
Bioeng. 29:1135-1143; and Aebischer et al. (1991) Biomaterials 12:50-55). Again, 
manipulation of the polymer can be carried out to provide for optimal release of viral 
10 particles. 

To further illustrate the use of the subject method, the therapeutic application of a 
CCR-gene, e.g., by gene therapy, can be used in the treatment of a neuroglioma. Gliomas 
accoimt for 40-50% of intracranial tumors at all ages of life. Despite the increasing use of 
radiotherapy, chemotherapy, and sometimes immunotherapy after surgery for malignant 

15 glioma, the mortality and morbidity rates have not substantially improved. However, there is 
increasing experimental and clinical evidence that for a significant number of gliomas, loss 
of TGF-p responsiveness is an important event in the loss of growth control. Irrespective of 
the cause of decreased responsiveness, e.g. the loss of fimction of pi 5 or the loss of other 
TGF-p signal transduction proteins, exogenous expression of pi 5 (or other CCR-protein) in 

20 the cell can used effectively to inhibit cell proliferation. 

It has been demonstrated that gene therapy can be used to target glioma cells for 
expression of recombinant protems (Miyao et al. (1993) J. Neurosci Res. 36:472-479; Chen 
et al. (1994) PNAS 91:3054-3057; and Takamiya et al. (1993) 7. Neurosurg. 79:104-110). 
Thus, a gene construct for expressing a CCR-protein can be delivered to the tumor, 

25 preferably by stereotactic-dependent means. In preferred embodiments, the gene delivery 
system is a retroviral vector. Smce rapidly growing normal cells are rare in the adult CNS, 
glioma cells can be specifically transduced with a recombinant retrovirus. For example, the 
retroviral particle can be delivered into the tumor cavity through an Ommaya tube after 
surgexy, or alternatively, packaging fibroblasts encapsulated in retrievable immunoisolatory 

30 vehicles can be introduced into the tumor cavity. In order to increase the effectiveness and 
decrease the side effects of the retrovirus-mediated gene therapy, glioma-specific promoters 
can be used to regulate expression of the CCR-gene. For example, the promoter regions of 
glial fibrillary acidic protein (GFAP) and myelin basis protem (MBP) can operably linked to 
the recombinant gene m order to direct glial cell-specific expression of the recombinant gene 

35 construct. 

In another embodiment, gene therapy can be used in conjunction with the subject 
CCR-protein in the treatment of various carcinomas. Among the carcinomas, a strong 
correlation has been observed between TGF-p responsiveness and both increasing malignant 
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tumor grade and increasing proliferating cell nuclear antigen index (Coffery et al. (1988) 
Cancer Res 48:1596-1602; Masui et al. (1986) PNAS 83:2438-2442; and Knabble et al. 
(1987) Cell 48:417-428). In a representative embodiment, a gene therapy system comprising 
the subject CCR-gene in a retroviral vector is used to treat certain breast cancers. In 
5 preferred embodiments, expression of the subject CCR-protein- gene is controlled at least in 
part by a manmiary-specific promoter, a number of which are available (for review, see 
Hennighausen (1990) Protein Expression and Purification 1:3-8; and Giinzberg et al. (1992) 
Biochem J 2^3:625-632). 

In similar feshion, gene therapy protocols involving delivery of the subject CCR-gene 

10 can be used in the treatment of malignant melanoma, which also serves as a model for 
progressive TGF-p resistance in transfonnation. In preferred embodiments, gene therapy 
protocols for treatment of melanomas include, in addition to the deliveiy of the CCR-gene 
construct by retroviral delivery, the delivery of a pharmaceutical preparation of the gene by 
direct injection. For instance, U.S. patent no. 5,318,514 describes an applicator for the 

1 5 electroporation of genes into epidermal cells and can be used in accordance with the present 
invention. 

The subject CCR-gene constructs can be used in the treatment of hyperproliferative 
vascular disorders, e.g. smooth muscle hyperplasia (such as atherosclerosis) or restinosis, as 
well as other disorders characterized by fibrosis, e.g. rheumatoid arthritis, insulin dependent 

20 diabetes mellitus, glomerulonephritis, cirrhosis, and scleroderma, particularly proliferative 
disorders in which loss of TGF-p autocrine or paracrine signaling is implicated. 

For example, restinosis continues to limit the eflScacy of coronary angioplasty despite 
various mechanical and pharmaceutical interventions that have been employed. An important 
mechanism involved in normal control of intimal proliferation of smooth muscle cells 

25 appears to be the induction of autocrine and paracrine TGF-p inhibitory loops in the smooth 
muscle cells (Scott-Burden et al. (1994) Tex Heart Inst J 21:91-97; Graiger et al. (1993) 
Cardiovasc Res 27:2238-2247; and Grainger et al. (1993) Biochem J 294 A09'U2y Loss of 
sensitivity to TGF-p, or alternatively, the overriding of this inhibitory stunuliis such as by 
PDGF autostimulation, can be a contributory factor to abnormal smooth muscle proliferation 

30 in restinosis. It may therefore be possible to treat or prevent restinosis by the use of gene 
therapy wifli CCR-gene constructs of the present mvention. The CCR-gene construct can be 
delivered, for example, by percutaneous translummal gene transfer (Mazur et al. (1994) Tex 
Heart Inst J 21:104-111) using viral or liposomal delivery compositions. An exemplary 
adenovinis-mediated gene transfer technique and compositions for treatment of cardiac or 

35 vascular smooth muscle is provided in PCT publication WO 94/1 1 506. 

Transforming growth factor-p is also understood to play a significant role in local 
glomerular and interstitial sites in human kidney development and disease. Consequently, 
the subject method provides a method of treating or inhibiting glomerulopathies and other 
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renal proliferative disorders comprising the in vivo delivery and recombinant expression of 
the subject CCR-protein in kidney tissue. 

The subject method can also be used to treat retinoblastomas in which the 
retinoblastoma gene (RB) is not itself impaired, e.g. the efifective impairment of the RB 
5 checkpoint is the result of a failure to control CDK4 phosphorylation of RB. Thus, an 
exogenous CCR-gene can be expressed in a retinoblastoma cell, thereby causing inhibition of 
CDK4 activation and down-regulating RB phosphorylation. To illustrate, a recombinant 
retrovirus can be constructed to facilitate expression of pi 6 or pi 5, and infectivity of 
retinoblastoma cells enhanced by derivatizing the env protein with antibodies specific for 

10 retinoblastoma cells, e.g. antibodies to retinal S-antigen (Doroso et al. (1985) Invest 
Opthalmol Vis Sci 26:560-572; see also Liao et al. (1981) Eur J Immunol 11:450-454; and 
U.S. Patent No. 4,444,744). 

In yet another embodiment, the subject CCR-gene is delivered to a sarcoma, e.g. an 
osteosarcoma or Kaposi's sarcoma. In a representative embodiment, the gene is provided in a 

15 viral vector and delivered by way of a viral particle which has been derivatized with 
antibodies immunoselective for an osteosarcoma cell (see, for example, U.S. Patents 
4,564,517 and 4,444,744; and Singh et al. (1976) Cancer Res 36:4130-4136). 

In a still further embodiment, the subject CCR-gene is recombinantiy expressed in 
tissue ^ch is characterized by unwanted de-differentiation and which may also be 

20 undergoing imwanted apoptosis. For instance, many neurological disorders are associated 
with degeneration of discrete populations of neuronal elements. For example, Alzheimer's 
disease is associated with deficits in several neurotransmitter systems, both those that project 
to the neocortex and those that reside with the cortex. For instance, the nucleus basalis in 
patients with Alzheimer's disease were observed to have a profound (75%) loss of neurons 

25 compared to age-matched controls. Although Alzhehner's disease is by far the most common 
form of dementia, several other disorders can produce dementia. Many are age-related, 
occurring in far greater incidence in older people than in yoimger. Several of these are 
degenerative diseases characterized by the death of neurons in various parts of the central 
nervoiis system, especially the cerebral cortex. However, some forms of dementia are 

30 associated with degeneration of the thalmus or the white matter underlying the cerebral 
cortex. Here, the cognitive dysfunction results from the isolation of cortical areas by the 
degeneration of efiTerents and afferents. Huntington's disease involves the degeneration of 
intrastraital and cortical cholinergic neurons and GABAergic neurons. Pick's disease is a 
severe neuronal degeneration in the neocortex of the fix>ntal and anterior temporal lobes, 

35 sometimes accompanied by death of neurons in the striatunL Accordingly, the subject CCR- 
gene can be delivered to the effected tissue by gene therapy techniques. It is noted that 
numerous advances have been made in the construction of expression vectors, cellular and 
viral transgene carriers, and the characterization of target cells for neuronal gene therapy, and 
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can be readily adapted for delivery of the subject CCR-genes (see, for example, Suhr et al. 
(1993) Arch Neurol 50:1252-1268; Jiao et al. (1993) Nature 362:450-453; Friedmann (1992) 
Am Med 24:41 1-417; and Freese et al. (1991) Nuc Acid Res 19:7219-7223) 

In addition to degenerative-induced dementias, the subject gene therapy systems can 
be applied opportunely in the treatment of neurodegenerative disorders which have 
manifestations of tremors and involimtary movements. Parkinson's disease, for example, 
primarily affects subcortical structures and is characterized by degeneration of the 
nigrostriatal pathway, raphe nuclei, locus cereleus, and the motor nucleus of vagus. Ballism 
is typically associated with damage to the subtfaalmic nucleus, often due to acute vascular 
accident Also included are neurogenic and myopathic diseases which ultimately affect the 
somatic division of the peripheral nervous system and are manifest as neuromuscular 
disorders. Examples include chronic atrophies such as amyotrophic lateral sclerosis, 
Guillain-Barre syndrome and chronic peripheral neuropathy, as well as other diseases which 
can be manifest as progressive bulbar palsies or spinal muscular atrophies. Moreover, the use 
of CCR gene therapy constructs is amenable to the treatment of disorders of the cerebellum 
which result m hypotonia or ataxia, such as those lesions m the cerebellum which produce 
disorders in the limbs ipsilateral to the lesion. For instance, pi 6 or pi 5 gene constructs can 
used to treat a restricted form of cerebellar cortical degeneration involving the anterior lobes 
(vermis and leg areas) such as is conunon in alcoholic patients. 

Furthermore, the subject CCR-genes can also be used in the treatment of autonomic 
disorders of the peripheral nervous system, vAnch include disorders affecting the innervation 
of smooth muscle and endocrine tissue (such as glandular tissue). For instance, lecombmant 
pi 6 or pi 5 expression by gene therapy can be used to treat tachycardia or atrial cardiac 
anythmias which may arise from a degenerative condition of the nerves mnervatmg the 
striated muscle of the heart. 

Yet another aspect of the invention pertains to CDK4 and CDK6 mutants which are 
msensitive to the inhibitory control of one or more of the subject CCR protems. With respect 
to cell cycle control, these mutants antagonize the action of CCR-proteins, and in preferred 
embodiments, are dominant negative mutants in that they remove from the control of mitosis 
certain cell-cycle checlqwints, such as checlq)oints mediated by RB or RB-like proteins. 
Exemplary CDK4 and CDK6 mutants are represented in SEQ ID Nos. 9 (CDK4) and SEQ ID 
Nos. 10 (CDK6), wherein Xaa at position 24 and 31 respectively, which is an Arg in the 
wild-type enayme, is mutated to render the enzyme insensitive to inhibition by CCR proteins. 
In preferred embodiments, Xaa is an amino acid other than arginine, though more preferably 
an amino acid which does not have a basic sidechain (e.g. not Arg, Lys or His). In a 
preferred embodiment, the Arginine is changed to a Cysteine. 

Other CDK4 mutants which have lost the ability to bind pl6 are described in 
Example 10 below. Accordingly, the Xaa at position 22 of CDK4 (residue 29 of CDK6) can 
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be an amino acid other than lysine, preferably an anuno acid with a basic sidechain or a non- 
polar sidechain; the Xaa at position 95 of CDK4 (residue 100 of CDK6) can be an amino acid 
other than a histidine, and is preferably an amino acid with an acidic or non-polar or 
hydrophobic sidechain; and the Xaa position 97 (residue 102 of CDK6) can be an amino acid 
5 other than and aspartic acid, and is preferably an amino acid with a basic sidechain or a non- 
polar sidechain. It will be appreciated that such CDK4/CDK6 mutants can comprise one or 
more of any of the above mutations. 

The mutant CDK4 and CDK6 proteins, genes encoding such proteins, and antibodies 
specific for such proteins, can be provided in a manner analogous to any of the embodiments 

10 described above for each of the CCR-proteins. For instance, nucleic acids encoding the 
mutant kinases can be provided in any of a number of forms, e.g, as recombinant expression 
vectors as well as probes. Both isolated and recombinant forms of the eirgnmes are 
specifically contemplated, as well as antibodies which q)ecifically bind the mutant fonns of 
each of the kinases. Moreover, it will be appreciated that, wherever the present application 

1 5 mentions the utility for CCR antagonists, the subject CDK mutants can be utilized. 

Moreover, while the Arg->Cys mutant of CDK4 was first discovered in cells from a 
melanoma patient, and the subsequent CDK4/CDK6 mutants isolated from analysis of 
molecular models of CDK4 based on this mutation (see Example 10), mutagenic techniques 
such as those described above can be utilized to isolate yet other CDK4 and CDK6 mutants 

20 vMch are resistant to CCR protein inhibition. For instance, scanning mutagenesis or random 
mutagenesis can be used to generate a library of CDK mutants which can be screened for 
dominant negative mutations employing assays sensitive to CDK4 or CDK6 inhibition. For 
example, a.library of CDK4 mutants can be transfected into cells which overexpress pi 6 and 
which are quiescent as a result. In the absence of mutations which uncouple the CDK4 

25 mutant from pl6/pl5, the cell will remain quiescent. In contrast, mutations to the kinase 
which uncouple its activity from pl6 will result in proliferation of cells expressing that 
mutant of CDK4. In a similar embodiment, the level of phosphorylated RB can be used to 
detect dominant negative mutants of CDK4, such as by inmiunodetection of phosphorylated 
RB, or the use of RB-sensitive reporter constructs the expression of which is dependent on 

30 the phosphorylation slate of RB (e.g., see Park et al. (1994) J Biol Chem 269:6083-6088; 
Ouellette et al. (1992) Oncogene 7:1075-1081; and Slack et al. (1993) Oncogene 8:1585- 
•1591). Mutants which uncouple pl6/pl5 inhibition of CDK4 will cause hi^er levels of 
phosphorylated RB protein to accumulate in the cell. 

Another aspect of the invention relates to the discovery of the CCR-binding pocket of 

35 CDK4/CDK6. As described in Example 1 0 below, we have ascertained fix)m point mutations 
and molecular modelling that a surface determined at least in part by residues K22, R24, H95 
and D97 of CDK4 (K29, R31, HlOO and D102 of CDK6) defines a binding pocket for CCR 
protems such as pi 6. This binding pocket, which is within the small lobe of CDK4, is in 
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close proximity to the ATP binding site for the kinase, the binding to which by pl6 likely 
blocks or distorts the ATP binding site of CDK4/CDK6. Accordingly, the present invention 
concerns three-dimensional molecular models of CDK proteins, and their use as templates for 
the design of agents able to inhibit at least one biological activity of that kinase. An integral 
5 step to an approach to designing CCR-like inhibitors of CDK4 or CDK6 involves 
construction of computer graphics models of the kinase which can be used to design 
pharmacophores by rational drug design based on mimicking the binding of CCR-proteins to 
the pl6-binding site. For instance, for an inhibitor to interact optimally with the kinase, it 
will generally be desirable that it have a shape which is at least partly complimentary to that 

10 of the pl6-binding site of the enzyme. Additionally, other factors, including electrostatic 
interactions, hydrogen bonding, hydrophobic interactions, desolvation effects, and 
cooperative motions of ligand and enzyone, all influence the binding effect and should be 
taken into account in attempts to design bioactive inhibitors. 

A computer-generated molecular model of the subject enzyme can be created. In 

15 preferred embodiments, at least the Ca-carbon positions of CDK4 or CDK6 are mapped to a 
particular coordmate pattern, such as the coordinates of CDK2 (DeBondt et al. (1993) Nature 
363:595-602; Endicott et al. (1994) Prot Eng 7:243-253; and Morgan et al. (1994) Curr Opin 
Cell Biol 6:239-246), by homology modeling, and the structure of the kinase and velocities of 
each atom are calculated at a simulation temperature (Tq) at which the docking simulation is 

20 to be determined. Typically, such a protocol involves primarily the prediction of side-chain 
conformations in the modeled kinase, while assuming a main-chain trace taken fix>m a-tertiaiy 
structure such as provided by the CDK2 structure. Computer programs for perfoiming 
energy minimization routines are commonly used to generate molecular models. For 
example, both the CHARMM (Brooks et al. (1983) J Cowpi// Chem 4:187-217) and AMBER 

25 . (Weiner et al (1981) J. CompuL Chem. 106: 765) algorithms handle all of the molecular 
system setup, force field calculation, and analysis (see also, Eisenfield et al. (1991) Am J 
Physiol 261:C376-386; Lybrand (1991) J Pharm Belg 46:49-54; Frounowitz (1990) 
Biotechniques 8:640-644; Burbam et al. (1990) Proteins 7:99-1 11; Pedersen (1985) Environ 
Health Perspect 61:185-190; and Kini et al. (1991) JBiomol Struct Dyn 9:475-488). At the 

30 heart of these programs is a set of subroutines that, given the position of every atom in the 
model, calculate the total potential energy of the system and the force on each atom. These 
programs may utilize a starting set of atomic coordinates, such as the model coordinates for 
CDK2, the parameters for the various terms of the potential energy function, and a 
description of die molecular topology (the covalent structure). Conunon features of such 

35 molecular modeling methods include: provisions for handling hydrogen bonds and other 
constraint forces; the use of periodic boimdary conditions; and provisions for occasionally 
adjusting positions, velocities, or other parameters in order to maintain or change 
temperature, pressure, volume, forces of constraint, or other externally controlled conditions. 
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Most conventional energy minimization methods use the input data described above 
and the &ct that the potential energy function is an explicit, differentiable function of 
Cartesian coordinates, to calculate the potential energy and its gradient (which gives the force 
on each atom) for any set of atomic positions. This information can be used to generate a 
5 new set of coordinates in an effort to reduce the total potential energy and, by repeating this 
process over and over, to optimize the molecular structure xmder a given set of external 
conditions. These energy minimization methods are routinely applied to molecules similar to 
CDK4 and CDK6 as well as nucleic acids, polymers and zeolites. 

In general, energy minimization methods can be carried out for a given temperature, Tj, 

10 which may be dififeient than the docking simulation temperature, Tq. Upon energy 
minimization of the molecule at Ti, coordinates and velocities of all the atoms in the system 
are computed. Additionally, the nonnal modes of the system are calculated. It will be 
appreciated by tfiose skilled in the art that erxh normal mode is a collective, periodic motion, 
with all parts of the system moving in phase with each other, and that tiie motion of the 

IS molecule is the superposition of all normal modes. For a given temperature, the mean square 
amplitude of motion in a particular mode is inversely proportional to the effective force 
constant for that mode, so that the motion of the molecule will often be dominated by the low 
frequency vibrations. 

After the molecular model has been energy minimized at Tj, the system is "heated" or 

20 "cooled" to the simulation temperature. To, by carrying out an equilibration run where the 
velocities of the atoms are scaled in a step-wise manner until the desired temperature, Tq, is 
reached. The system is further equilibrated for a specified period of time until certain 
properties of the system, such as average kinetic energy, remain constant. The coordinates 
and velocities of each atom are then obtained from the equilibrated system. 

25 Further energy minimization routines can dlso be carried out For example, a second 

class of methods involves calculating approximate solutions to the constrained EOM for the 
protein. These methods use an iterative approach to solve for the Lagrange multipliers and, 
typically, only need a few iterations if the corrections required are small. The most popular 
method of this type, SHAKE (Ryckaert et al. (1977) J Comput Phys 23:327; and Van 

30 Ounsteren et al. (1977) Mol Phys 34:131 1) is easy to implement and scales as 0(N) as the 
number of constraints increases. Therefore, the method is applicable to macromolecules such 
as the CDK proteins of interest. An alternative method, RATTLE (Anderson (1983) J 
Comput Phys 52:24) is based on the velocity version of the Verlet algorithm. Like SHAKE, 
RATTLE is an iterative algorithm and can be used to energy minimize the model of the 

35 subject protein. 

The increasing availability of biomacromolecule structures of potential 
pharmacophoric molecules that have been solved crystallographically has prompted the 
development of a variety of direct computational methods for molecular design, in which the 
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steric and electronic properties of substrate binding sites are use to guide the design of 
potential inhibitors of CDK4 and/or CDK6 (Cohen et al. (1990) 1 Med, Cam. 33: 883-894; 
Kuntz et al. (1982) J. Mol Biol 161: 269-288; DesJarlais (1988) J. Med Cam, 31: 722-729; 
Bartlett et al. (1989) {Spec, Publ, Roy. Soc, Chem.) 78: 182-196; Goodford et al. (1985) J, 
5 Med Cam, 28: 849-857; DesJarlais et al. J, Med Cam, 29: 2149-2153). Directed methods 
generally fall mto two categories: (1) design by analogy in which 3-D structures of known 
molecules (such as from a crystallographic database) are docked to the enzyme structure and 
scored for goodness-of-fit; and (2) de novo design, in which the ligand model is constructed 
piece-wise in the enzyme. The latter approach, in particular, can facilitate the development of 

1 0 novel molecules, uniquely designed to bind to the kinase CCR-binding site. 

In an illustrative embodunent, the design of potential CDK4/CDK6 inhibitors begins 
from the general perspective of shape complimentary for the pl6-binding sitiei, and a search 
algorithm is employed which is capable of scanning a database of small molecules of known 
three-dunensional structure for candidates which fit geometrically into the target binding site. 

15 It is not expected that the molecules found in the shape search will necessarily be leads 
themselves, since no evaluation of chemical interaction necessarily be made during the initial 
search. Rather, it is anticipated that such candidates might act as the framework for further 
design, providing molecular skeletons to which appropriate atomic replacements can be 
made. Of course, the chemical complimentary of these molecules can be evaluated, but it is 

20 expected that atom types will be changed to maximize the electrostatic, hydrogen bonding, 
and hydrophobic interactions with the em^e. Most algorithms of this type provide a 
method for finding a wide assortment of chemical structures that aro complementary to the 
shape of the pl6-bmding site of the subject kinases. Each of a set of small molecules from a 
particular data-base, such as the Cambridge Crystallographic Data Bank (CCDB) (Allen et al. 

25 (1973) J, Chem Doc, 13: 1 19), is individually docked to the pl6-binding site of the kinase in 
a number of geometrically permissible orientations with use of a docking algorithm. In a 
preferred embodiment, a set of computer algorithms called DOCK, can be used to 
characterize the shape of invaginations and grooves that form the pl6 recognition surfaces of 
the CDK protein (Kuntz et al. (1982) J. Mol Biol 161: 269-288). The program can also 

30 search a database of small molecules for templates whose shapes are complementary to the 
pl6-bmding sites of the enzyme (DesJarlais etal. (1988)7 A/crfCtew 31: 722-729). These 
' templates normally require modification to achieve good chemical and electrostatic 
interactions (DesJarlais et al. (1989) ACS Symp Ser 413: 60-69). However, the program has 
been shown to position accurately known cofactors for inhibitors based on shape constraints 

35 alone. 

The orientations are evaluated for goodness-of-fit and the best are kept for further 
examination usmg molecular mechanics programs, such as AMBER or CHARMM. Such 
algorithms have previously proven successful in finding a variety of molecules that are 
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complementary in shape to a given binding site of a receptor-enzyme, and have been shown 
to have several attractive features. First, such algorithms can retrieve a remarkable diversity 
of molecular architectures. Second, the best structures have, in previous applications to other 
proteins, demonstrated impressive shape complementarity over an extended surface area. 
5 Third, the overall approach appears to be quite robust with respect to small uncertainties in 
positioning of the candidate atoms. 

Goodford (1985, J Med Chem 28:849-857) and Boobbyer et al. (1989, J Med Chem 
32: 1 083- 1 094) have produced a computer program (GRID) which seeks to determine regions 
of high affmity for different chemical groups (termed probes) on the molecular surface of the 

10 binding site. GRID hence provides a tool for suggesting modifications to known ligands, 
such as pi 6 and the like, that might enhance binding. It may be anticipated that some of the 
sites discerned by GRID as regions of high afBnity correspond to '*pharmacophoric patterns" 
determined inferentially from a series of known ligands. As used herein, a pharmacophoric 
pattern is a geometric arrangement of features of the anticipated ligand that is believed to be 

15 important for binding. Accordingly, short overlapping peptidyl portions of the subject CCR- 
proteins can be scanned for ability to bind the pl6-binding site defined for CDK4 or CDK6. 

Yet a further embodiment of the present invention utilizes a computer algorithm such 
as CLIX which searches such databases as CCDB for small molecules which can be oriented 
in the receptor binding site in a way that is both sterically acceptable and has a high 

20 likelihood of achieving favorable chemical interactions between the candidate molecule and 
the surrounding amino acid residues. The method is based on characterizing the pl6-binding 
site of the kinase in terms of an ensemble of favorable binding positions for different 
chemical groups and tiien searching for orientations of the candidate molecules that cause 
maximimi spatial coincidence of individual candidate chemical groups with members of the 

25 ensemble. The current availability of computer power dictates that a computer-based search 
for novel ligands follows a breadth-first strategy. A breadth-first strategy auns to reduce 
progressively the size of the potential candidate search space by the application of 
increasingly stringent criteria, as opposed to a depth-first strategy wherein a maxunally 
detailed analysis of one candidate is performed before proceeding to the next. CLIX 

30 conforms to this strategy in that its analysis of binding is rudunentary -it seeks to satisfy the 
necessary conditions of steric fit and of having individual groups in "correct" places for 
•bonding, without imposing flie sufficient condition that favorable bonding interactions 
actually occur, A ranked "shortlist" of molecules, in their favored orientations, is produced 
which can then be examined on a molecule-by-molecule basis, using computer graphics and 

35 more sophisticated molecular modeling techniques. CLIX is also capable of suggesting 
changes to the substituent chemical groups of the candidate molecules that might enhance 
binding to the pl6-binding pocket of the kinase. 
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The algorithmic details of CLIX is described in Lawerence et al. (1992) Proteins 
12:31-41, and the CLIX algorithm can be summarized as follows. The GRID program is 
used to determine discrete favorable interaction positions (termed target sites) in the pie- 
binding site of the kinase for a vsdde variety of representative chemical groups. For each 
candidate ligand in the CCDB an exhaustive attempt is made to make coincident, in a spatial 
sense in the binding site of the protein, a pair of the candidate's substituent chemical groups 
with a pak of corresponding favorable interaction sites proposed by GRID. All possible 
combinations of pairs of ligand groups with pairs of GRID sites are considered during this 
procedure. Upon locating such coincidence, the program rotates the candidate ligand about 
the two pairs of groups and checks for steric hindrance and comcidence of other candidate 
atomic groups with appropriate target sites. Particular candidate/orientation combinations 
that are good geometric fits in the binding site and show sufficient coincidence of atomic 
groups with GRID sites are retained. 

Consistent with the breadth-first strategy, this approach involves simplifying 
assumptions. Rigid protem and small molecule geometry is maintained throughout. As a 
first approximation, rigid geometry is acceptable as the energy mmimized coordinates of the 
CDK4 or CDK6 deduced structure describe an energy minimum for the molecule, albeit a 
local one. A further assumption implicit in CLDC is that the potential ligand, when 
mtroduced into the pl6-binding site of the kinase, does not induce change in the protein's 
stereochemistiy or partial charge distribution and so alter the basis on which the GRID 
interaction energy m^s were computed. It must also be stressed that the interaction sites 
predicted by GRID are used in a positional and type sense only, i.e., when a candidate atomic 
group is placed at a site predicted as favorable by GRID, no check is made to ensure that the 
bond geometry, the state of protonation, or the partial charge distribution favors a strong 
interaction between the protein and that group. Such detailed analysis should fomi part of 
more advanced modeling of candidates identified in the CLK shortlist. 

Yet another embodiment of a computer-assisted molecular design method for 
identifying specific inhibitors of CDK4 or CDK6 comprises the de novo synthesis of 
potential inhibitors by algorithmic connection of small molecular firagments that will exhibit 
the desired structural and electrostatic complementarity with the pl6-bindmg site of the 
enzyme. The methodology employs a large template set of small molecules with are 
•iteratively pieced together in a model of the CDK4/CDK6 pl6-binding site. Each stage of 
ligand growdi is evaluated according to a molecular mechanics-based energy fimction, which 
considers van der Waals and coulombic interactions, internal strain energy of the lengthening 
ligand, and. desolvation of both ligand and enzyme. The search space can be managed by use 
of a data tree which is kept under control by pruning according to the binding criteria. 

In an illustrative embodiment, the search space is limited to consider only amino acids 
and amino acid analogs as the molecular building blocks. Such a methodology generally 
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employs a large template set of amino acid conformations, though need not be restricted to 
just the 20 natural amino acids, as it can easily be extended to include other related fiagments 
of interest to the medicinal chemist, e.g. amino acid analogs. The putative ligands that result 
from this construction method are peptides and peptide-like compounds rather than the small 
5 organic .molecules that are typically the goal of drug design research^ The appeal of the 
peptide building approach is not that peptides are preferable to organics as potential 
pharmaceutical agents, but rather that: (1) they can be generated relatively rapidly de novo; 
(2) their energetics can be studied by well-parameterized force field methods; (3) they are 
much easier to synthesize than are most organics; and (4) they can be used in a variety of 

10 ways, for peptidomimetic inhibitor design, protein-protein binding studies, and even as shape 
templates in the more commonly used 3D organic database search approach described above. 
Moreover, peptidyl fragments such as may be vutually derived from the CCR-protein 
sequences described herein, can be employed. 

Such a de novo peptide design method has been incorporated in a software package 

15 called GROW (Moon et al. (1991) Proteins 11:314-328). In a typical design session, 
standard interactive graphical modeling methods are employed to define the structural 
environment in vAdch GROW is to operate. For instance, the environment could be the pie- 
binding site of CDK4, or it could be a set of features on the protein's surface to which the user 
wishes to bind a peptide-like molecule, e.g. residues K22, R24, H95 and D97 of CDK4. The 

20 GROW program then operates to generate a set of potential ligand molecules. Interactive 
modeling methods then come into play again, for examination of the resulting molecules, and 
for selection of one or more of them for fiirther refinement. 

To illustrate, GROW operates on an atomic coordinate file generated by the user in 
the interactive modelmg session, such as the coordinates provided by homology modelling of 

25 CDK4, or the simply the coordinates of the pl6-binding site, plus a small fragment (e.g., an 
acetyl group) positioned in the pl6-binding site to provide a starting point for peptide growth. 
These are referred to as "site" atoms and "seed" atoms, respectively. A second file provided 
by the user contains a number of control parameters to guide the peptide growth (Moon et al. 
(1991) Pro/e//ur 11:314-328). 

30 The operation of the GROW algorithm is conceptually fairly simple. GROW 

proceeds in an iterative fashion, to systematically attach to the seed firagment each amino acid 
template in a large preconstnicted library of amino acid conformations. When a template has 
been attached, it is scored for goodness-of-fit to the receptor site, and then the next template 
in the library is attached to the seed. After all the templates have been tested, only the highest 

35 scoring ones are retained for the next level of growth. This procedure is repeated for the 
second growth level; each library template is attached in turn to each of the bonded 
seed/amino acid molecules that were retamed from the first step, and is then scored. Again, 
only the best of the bonded seed/dipeptide molecules that result are retained for the third level 
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of growth. The growth of peptides can proceed m the N-to-C direction only, the reverse 
direction only, or in alternating directions, depending on the initial control deifications 
supplied by the user. Successive growth levels therefore generate peptides that are 
lengthened by one residue. The procedure terminates when the user-defined peptide length 
5 has been reached, at which point the user can select from the constructed peptides those to be 
studied fiirther. The resulting data provided by the GROW procedure include not only 
residue sequences and scores, but also atomic coordinates of the peptides, related directly to 
the coordinate system of the receptor site atoms. 

In yet another embodiment, potential pharmacophoric compounds can be determined 

10 using a method based on an energy minimization-quenched molecular dynamics algorithm 
for determining energetically favorable positions of fimctional groups in the pl6-binding sie 
of a CDK. The method can aid in the design of molecules that incorporate such fimctional 
groups by modification of known ligands or de novo construction. 

For example, the multiple copy sunultaneous search method (MCSS) described by 

15 Miranker et al. (1991) Proteins 1 1 : 29-34 can be employed. To detennine and characterize a 
local minima of a fimctional group in the forcefield of the protein, multiple copies of selected 
fimctional groups are first distributed in the pl6-binding site of the CDK protein of interest.' 
Energy minimization of these copies by molecular mechanics or quenched dynamics yields 
the distinct local nunima. The neighborhood of these minima can then be explored by a grid 

20 search or by constrained minimization. In one embodiment, the MCSS method uses the 
classical time dependent Hartee (TDH) approximation to simultaneously minimize or quench 
many identical groups in the forcefield of the protein. 

Implementation of the MCSS algorithm requires a choice of fimctional groups and a 
molecular mechanics model for each of them. Groups must be simple enough to be easily 

25 characterized and manipulated (3-6 atoms, few or no dihedral degrees of freedom), yet 
complex enough to approxunate the steric and electrostatic mteractions that the fimctional 
group would have in binding to the pl6-bmding site of the kinase. A preferred set is, for 
example, one in which most organic molecules can be described as a collection of such 
groups {PataVs Guide to the Chemistry of Functional Groups, ed. S. Patai (New York: John 

30 Wiley, and Sons, (1989)). This includes fiagments such as acetonitrile, methanol, acetate, 
methyl ammonium, dimethyl ether, methane, and acetaldehyde. 

Determination of Ae local energy minima in the bmding site reqixires that many 
starting positions be sampled. This can be achieved by distributmg, for example, 1,000-5,000 
groups at random inside a sphere centered on the binding site; only the space not occupied by 

35 the protein needs to be considered. If the interaction energy of a particular group at a certain 
location with the protem is more positive than a given cut-off (e.g. 5.0 kcal/mole) the group is 
discarded from that site. Given the set of starting positions, all the fragments are minimized 
simultaneously by use of the TDH approximation (Elber et al. (1990) J Am Chem Soc 1 12: 
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9 161 -91 75). In this method, the forces on each fragment consist of its internal forces and 
those due to the protein. The essential element of this method is that the interactions between 
the fragments are omitted and the forces on the protein are normalized to those due to a single 
fragment. In this way simultaneous minimization or dynamics of any number of functional 
5 groups in the field of a single protein can be performed. 

Minimization is performed successively on subsets of, e.g. 100, of the randomly 
placed groups. After a certain number of step intervals, such as 1,000 intervals, the results 
can be examined to eliminate groups converging to the same minimum. This process is 
repeated until minimization is complete (e.g. RMS gradient of 0.01 kcal/mole/A). Thus the 

1 0 resulting energy minimized set of molecules comprises what amounts to a set of disconnected 
fragments in three dimensions representing potential pharmacophores. 

The next step then is to connect the phannacophoric pieces with spacers assembled 
frx)m small chemical entities (atoms, chains, or ring moieties). In a preferred embodiment, 
each of the disconnected can be linked in space to generate a single molecule using such 

15 computer programs as, for example, NEWLEAD (Tschinke et al. (1993) J Med Chem 36: 
3863,3870). The procedure adopted by NEWLEAD executes the following sequence of 
commands (1) connect two isolated moieties, (2) retain the intermediate solutions for further 
processing, (3) repeat the above steps for each of the intermediate solutions untU no 
disconnected units are found, and (4) output the final solutions, each of which is single 

20 molecule. Such a program can use for example, three types of spacers: library spacers, 
single-atom spacers, and fuse-ring spacers. The library spacers are optimized structures of 
small molecules such as ethylene, benzene and methylamide. The ou^ut produced by 
programs such as NEWLEAD consist of a set of molecules containing the original fragments 
now connected by spacers. The atoms belonging to the input fragments maintain their 

25 original orientations in space. The molecules are chemically plausible because of the simple 
makeup of the spacers and functional groups, and energetically acceptable because of the 
rejection of solutions with van-der Waals radii violations. 

Still another aspect of the mvention relates to CDK mutants which have been 
rendered sensitive to CCR proteins such as pl6 or pl5. For instance, CDK2 or CDC2, which 

30 do not normally bind pi 6, and be rendered sensitive to pi 6 inhibition by replacing certain 
residues of the kinase with the corresponding residues from CDK4 \^ch are involved in pi 6 
-binding. For example, as Figure 7 illustrates, portions of CDK4 corresponding to pi, P2, p3, 
p4, P5, or L7 can be interchanged into CDK2 or CDC2 in order to yield a pl6-sensitive 
CDK. Such mutants can be used, for example, to generate recombinant cells eifter in vitro or 

35 in vivo vMch can be inhibited at other points of the cell-cycle by the expression of a CCR- 
protein. 
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The invention now being generally described, it will be more readily understood by 
reference to the following examples which are mcluded merely for purposes of illustration of 
certain aspects and embodiments of the present invention, and are not intended to limit the 
5 invention. 

As described previously (see U.S. Patent Applications Serial Nos. 08/154,915, 
07/991,997 and 07/963,308, as well as Xiong et al. (1993) Nature 366:701; Xiong et al. 
(1993) Genes Dev 7:1572; Xiong et al. (1992) Cell 71:505; and Zhang et al. (1993) Mol Cell 
Biol 4:897), immunological procedures have been used to establish that cyclins associate, in 

10 eukaryotic cells, with a variety of potential catalytic subunits (e.g., CDKs, such as CDK2, 
CDK4 and CDKS). To illustrate, human cyclin Dl has been associated with a wide variety of 
proliferative diseases. In human diploid cells, specifically human diploid fibroblasts, cyclin 
Dl is complexed with a number of other cellular proteins. Among them are the catalytic 
subunits CDK2, CDK4 (previously called PSK-J3), CDKS (also called PSSALRE), and 

15 CDK6 (PLSTIRE). In addition, polypeptides of 21 kDa and 36 kDa were identified in 
association with cyclin Dl. It was shown that the 36 kDa protein is the Proliferating Cell 
Nuclear Antigen, PCNA. PCNA has been described as an essential accessory factor to the 
delta polymerase, which is reqxiired for leading-strand DNA replication and DNA repair. 
Cyclin D3 was also found to associate with multiple protein kinases, p21 and PCNA. It was 

20 therefore proposed that there exists a quaternary complex of D type cyclins, CDK, PCNA and 
p21, and that many combinatorial variations (qrclin Dl, D3 with CDK2, 4, 5 and 6) may 
assemble in vivo. 

The importance of the quaternary complex is emphasized by the discovery that 
celliilar transformation by DNA tumor viruses is associated with selective subunit 

25 rearrangement of the cyclin D complexes, as well as other cell-c^cle complexes, including 
cyclin A, CDC2, CDK2, CDK4 and CDKS complexes. In particular, introduction of SV40 
DNA tumor virus or its oncogenic gene product large T antigen into normal human diploid 
fibroblasts (HDF) causes disruption of the association between cyclin D and PCNA, CDKs 
(such as CDK2, CDK4, CDKS and CDK6) and p21. For example, after dissociation fi-om 

30 cyclin D and p21, CDK4 kinase becomes associated with a 16 kDa polypeptide (pl6). 
Similarly, SV40 transformation causes a decrease of association of p21 with cyclin A in 
•HDF; and adenovmis-(293 cell line) or human papilloma virus- (HeLa cell line) transformed 
cells, p21 is completely disassociated fix)m cyclin A. A 19 kDa peptide, pl9, tiien appears in 
a complex with cyclin A. 

35 Thus, m many transformed cells, cyclin and CDKs associate in binary complexes 

which form the core of the cell-cycle regulatory machinery. In normal cells, a major fraction 
of the cyclin kinases acquire two additional subunits (p21 and PCNA) and thereby form 
quaternary complexes. Reconstitution of quaternary complexes in insect cells revealed that 
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p2I is a universal inhibitor of cyclin kinases. As such, p21 inhibits cell-cycle progression 
and cell proliferation vpon overe7q}ression in mammalian cells. Taken in conjunction with 
the previously demonstrated absence of p21 protein in the cell-cycle kinase complexes of 
cells with deficient p53, these results suggest that p21 is a transcriptional target of the tumor 
5 suppressor protein, p53. One function of p53 is to act in a cellular signaling pathway which 
causes cell-cycle arrest following DNA damage (see for example, Kastan et al. Cell 71 :587- 
5971993). It has therefore been suggested that p21 fonns a critical link between p53 and the 
cell-cycle control machinery. 

Cyclin D/CDK4 kinase differs fit>m the others in its inability to utilize histone HI as a 

10 substrate. To date, the only substrates known for cyclin D/CDK4 kinases are the members of 
the RB family of "pocket" proteins (Matsushime et al.. Cell 71:323-334 (1992)). Therefore, 
the effect of p21 was tested on the ability of cyclin D/CDK4 to phosphorylated RB. Insect 
cell lysates containing cyclin D or CDK4 alone showed little activity toward GST-RB. 
However, cyclin D/CDK4 binary complexes catalyzed substantial RB phosphorylation. 

15 Addition of increasing amoimts of p21 resulted m the accumulation of cyclin D/CDK4/p21 
ternary complexes with a corresponding inhibition of RB phosphorylation. Inclusion of 
PCNA was essentially without effect. However, cells lacking functional p53 may 
nevertheless retain a functional RB checkpoint which undergoes differential phosphorylation 
despite lack of endogenous p21 . 

20 The two-hybrid screening system (Fields et al. Nature 340:254 (1989)) was utilized to 

search for proteins that could interact with human CDK4, and more specifically, to isolate a 
cDNA encoding pl6. Two-hybrid screening relies on reconstitutmg a functional GAL4 
activator from two separated fusion proteins: the GAL4 DNA-binding domain fiised to 
CDK4, GAL4db-CDK4; and the GAL4 activation domain fused to Ae proteins encoded by 

25 HeLa cDNAs, GAL4ad-cDNA. YPB2 was used as the recipient yeast as it is a strain that 
contains two chromosomal genes under the control of two different GAL4-dependent 
promoters: HIS3 and LacZ. YPB2 was transformed with a mixture of two plasmids encoding, 
respectively, the GAL4db-CDK4 and the GAL4ad-cDNA fusions; several clones were 
obtained that grew in the absence of histidine and that turned blue in the presence of p-gal. 

30 From DNA sequencing data it was determined fliat each of the positive clones derived from 
the same gene, although one group represented mRNAs with a shorter 3' end. The sequence 
of these cDNAs contained, in-phase with the GAL4ad, an open reading frame encodmg a 
protein of 148 amino acids with a predicted molecular weight of 15,845 daltons (see SEQ ID 
Nos. 1 and 2). The sequence of pi 6 was compared by standard methods with those present in 

35 the currently available data banks and no significant homologies were foimd. 

To test if pl6 would specifically bind CDK4, YPB2 were cotransformed with tiie 
GAL4ad-pl6 fusion as well as with several target GAL4db fusion constructs containing, 
respectively, cdc2, CDK2, CDK4, CDK5, PCNA and Snfl (a fission yeast kinase). 
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Transfonned cells were plated with and without histidirie. Only the GAL4db-CDK4 fusion 
interacted with GAL4ad-pl6 to an extent which allowed growth in the absence of histidine, 
indicating that this pair of fusion protems specifically reconstituted a functional GAL4 
activator able to enhance the expression of the HISS gene. The same result was obtained 
5 when the ability to transactivate the expression of the p-galactosidase gene was assayed. 

The specificity of this interaction was further demonstrated in a cell-free system, by 
mixing in vitro translated (35s)-labeled CDKs with a purified bacterially-produced fusion 
protein consisting of glutathione-S transferase (GST) Imked to pl6 (17). The GST-pl6 fusion 
was recoveared by binding to glutathione-sepharose beads and the association of each CDK 

10 was analyzed by gel electrophoresis* Consistent with the previous observations, GST-pl6 
bound much more efSSciently to CDK4 than to f:dc2, CDK2 or CDKS. 

Smce the predicted molecular weigh: of pl6 is close to 16 Kd, the idratity of pl6 as 
the CDK4<-associated pi 6 protein found in transformed cell lines (see above) was deteimined. 
Two in vitro translation products of 15 KD aud 17 KD were obtained from the pl6 cDNA, 

15 These products, as well as the CDK4-associated pi 6 protein from HeLa cells were treated 
with N-chlorosuccinimide. The partial NCS -proteolytic pattern of the 17 KD cDNA-derived 
product was very sunilar to the pattern obtained with the CDK4-associated pl6 protein from 
HeLa cells, strongly suggesting that the pi 6 cDNA actually corresponds to pi 6. Partial 
digestion with V8 protease of the 17 KD cDNA-derived product and pl6 also yielded similar 

20 patterns. It is interesting to note that the pi 6 protein overe}q>ressed in insect cells has an 
clectrophoretic mobility of 15 KD, and its NCS proteolytic map is identical to that obtained 
with the 15 KD cDNA derived product. This suggests that the actual pl6 found in human 
cells and the 17 KD in vitro translation product correspond to posttranslationally modified 
proteins. The fact that the pi 6 protein overexpressed in insect cells interacts with CDK4 

25 suggests that this modification is not essentiil for the interaction (see below). 

The identity between pi 6 and the CDK4-associated protein pi 6 was further 
confirmed using antibodies raised against th? purified GST-pl6 fusion protein. Seveml 
human cell Imes were used for this experiment: a normal cell line WI38, derived from normal 
lung fibroblasts; the VA13 cell line derived from WI38 by transformation with the SV40 T- 

30 antigen; and HeLa cells. As set out above, anti-CDK4 immunoprecipitates of WI38 revealed 
the association of CDK4 with cyclm Dl, PCNA, p21 and pl6. In contrast, in VA13 and HeLa 
•cells CDK4 is only associated with pl6. Anti-pl6 immunoprecipitates contained a protein 
with an apparent molecular weight of 16 KD which was readily detectable in the two 
transfonned cell lines, VA13 and HeLa but to a lesser extent in the normal cell line WI38. 

35 This protein not only had the same clectrophoretic mobility as the pi 6 protein 
coinununoprecipitated with anti-CDK4 serum, but also had an identical NCS partial 
proteolytic pattern. In addition to pl6 a protein of 33Kd was observed in anti-pl6 
coimmunoprecipitates that was shown to be identical to CDK4 by V8-proteolytic mapping. 



wo 95/28483 PCT/US95/04636 

-62- 

Northem analysis of the transcripts present in WI38 and VA13 cells indicated that the 
pl6 mRNA was around many times lss& abimdant in WI38 cells compared to VA13 cells. 
This difference approximately corresponded to the observed difference in the amount of pi 6 
protein between the two cell lines, suggesting the possibility that pi 6 expression might be 
5 regulated at a transcriptional or post-transcriptional level. Indeed, in three non-viraUy 
transformed cell lines the expression of pi 6 could not be detected even after overexposure of 
the geL 

To study the biochemical consequences of the interaction of pi 6 with CDK4, active 
CDK4-cyclin D complexes have been reconstituted in vitro by standard protocols (Kato et al. 

10 Genes Der 7:331 (1993); and Ewen et al. Cell 73:487 (1993)). The three relevant 
components, CDK4, pi 6 and cyclm Dl, were expressed in baculovirus-infected insect cells. 
Extracts were prepared from metabolically (^^S)-labeled insect cells that separately 
overexpressed pi 6, CDK4 or cyclin Dl, as well as from cells overexpressing both CDK4 and 
cyclin Dl . In response to increasing amounts of pl6, corresponding decreases in the ability of 

15 CDK4 to phosphorylate RB was observed. This inhibition correlated with the association 
between pi 6 and CDK4 as detected by inununoprecipitation. No inhibition was observed 
when CDK2-cyclm D2 complexes were used in a similar assay. To confirm that the 
inhibition of CDK4 was due to pl6, a His-tagged pl6 fusion protem (His-pl6) was created to 
have an amino terminal extension of 20 amino acids containing a tract of 6 histidine residues. 

20 This fusion protein was overexpressed in baculovinis-infected insect cells, and was purified 
by virtue of the high-affinity association of the histidine tract to nickel-agarose beads. The 
His-pl6 protein preparation was shown to be >90% pure, and inhibited the activity of the 
CDK4-cyclin Dl complex under conditions similar to those used for inhibition by the whole 
lysates. 

25 The role of the retinoblastoma gene product (RB), appears to be as a cell-cycle 

checkpoint which appears to at least act be sequestering transcription factors responsible for 
the proteins of phase. In many carcinomas, p53 function is lost by mutation or deletion. RB, 
on the other hand, is not apparently altered as often. However, because pi 6 down-regulates 
the CDK4/cyclin D complex, which acts to phosphorylate RB, it is proposed herem that pi 6 

30 loss in certain carcinomas can alleviate the effects of the RB checkpoint and, in some manner 
of speaking, rq)resent a checkpoint deficiency analogous to p53 loss. The loss of pl6 would 
-result in more effective phosphorylation of RB and hence would remove the RB-mediated 
inhibition of the cell-cycle. Consistent with this notion, it is described below that in a variety 
of human tumor cells, such as cells which over-express a D-type cyclin, e.g. cyclin Dl or D2, 

35 the pl6 gene is lost from the cell, e.g. homozygously deleted. 

Moreover, a^ describee! in the examples below, the pi 6 gene was found to map to the 
human region 9p21-22, a known melanoma locus (Walker et al. (1994) Oncogene 9:819; 
Coleman et al. (1994) Cancer Res 54:344; Cheng et al. (1993) Cancer Res 53:4761; and 
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Cannon-Alhright et al. (1992) Science 258:1148). The chromosomal mapping was further 
confirmed by analysis of somatic cell hybrids through PCR amplification (using primers 
exl A and exl3 of Figure 2A). Somatic hybrids containing human chromosome 9 resiilted in 
positive PCR products being amplified. 
5 Utilizing primers generated fi-om the cDNA sequence of human pl6 (SEQ ID No. 1) 

which are shown in Figure 1, the genomic pl6 gene was partially sequenced to determine 
intron/exon boundaries. The approximate sequences of the nucleic acid flanking Exon 1 and 
Exon 2 (see Figure 1) are shown in Figures 2A and 2B and 3A-D respectively. 

Genomic DNA was isolated fit)m a variety of human tumor lines (Sambrook et al. 

10 Molecular Cloning: A Laboratory Manual, CSHL Press, Cold Spring Harbor, NY (1989)) 
and was probed by PCR reactions for Ae presence or absence of pi 6 sequaices. In 
particular, primers exlA and exl3 (Figure 2A) were used to score for exon 1 of pl6, and 
primers exl4 and exl5 were likewise used to detect exon 2 of pl6. As shown in Figure 4, the 
pi 6 gene is disrupted in several tumor cell Imes, confirming that pl6 is indeed likely to be 

15 critical in cell transformation in certain cancerous cells. Moreover, probing of these cell lines 
with fiill length pl6 cDNA (SEQ ID No. 1) demonstrated that in at least 3 of those cells 
apparently missing a portion of the pi 6 gene, the entire gene was in fact absent. 

Based on immunoprecipitation e}q)eriments with anti-pl6 antibodies, as well as 
oligonucleotide hybridization ass^s, it became apparent that the pl6 protein represented by 

20 SEQ ID No. 2 is merely one member of a larger family of related cell-<7cle regulatory 
proteins. For instance, even under high stringenqr conditions, Southern hybridization 
e)q)eriments of mRNA fi»m dififerent tissue types has indicated that approximately 4 closely 
rehited transcripts are produced. These pl6 homologs, members of the CCR-protein fimiily, 
may have arisen by gene duplication (e.g. each CCR-protein arises &om a distinct gene) or 

25 fiom alternate exon splicing at the mRNA level, or a combination thereof. 

Utilizing a probe consisting of fte coding region of the human pi 6 gene, we have 
screened a mouse embryonal stem cell library and have isolated a genomic clone containing 
the coding region for a mouse homolog of the human pi 6 gene described above. This clone 
was isolated under low to modoate stringency conditions (1 x SSC at 50''C). This DNA 

30 (14kB) has been cloned in two independent pieces and the restriction map for nine restriction 
endonucleases has been performed (see Figure 5). The mouse CCR-gene has been completely 
•sequenced and the coding region is apparently made up of only two exons that have been 
located in the restriction map by Southern hybridization. The apparent molecular weight of 
the mouse CCR-protein is 13.5kDa, and the nucleic acid and ammo acid sequence of the 

35 mouse CCR-protein, termed herein "pl3.5", is given m SEQ ID No. 5 and 6 respectively. 

Moreover, utilizing degenerate probes based on the most highly conserved sequaices 
between the human pl6 clone and mouse pl3.5 clone (e.g. between residues Met-52 through 
Gly-135 of human pl6 and Met-1 dirough Gly-83 of mouse pl3.5), we have isolated a 
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number of human and mouse pl6 homplogs, such as the mouse plS homolog (SEQ ID Nos. 7 
and 8). 

In addition, it has been noted that TGF-P treatment causes accumulation of RB in the 
imder-phosphorylated state and expression of RB-inactivating viral oncoproteins prevents 
5 TGF-P induced cell cycle arrest (Laiho et. al. (1990) Cell 62:175-185; and Pietenpol et al. 
(1990) Cell exmi'lZSy WWle prior publications have suggested that TGF-p treatment 
results in down-regulation of CDK4 expression (Ewen et al. (1993) Cell 74:1009-1020), the 
data suggested to us that TGF-p might function through suppression of RB phosphorylation 
and pointed to the possibility that one resuh of TGF-P treatment might be inhibition of cyclin 

1 0 dependent kinases. 

AccordmgIy» to investigate the mechanism by v^ch TGF-p inhibits cell proliferation, 
we examined anti-CDK immunoprecipitates from human keratinocytes which had been 
anested in Gl by exposure to TGF-p. Notably, immunoprecipitates of two Gl -specific 
cyclin kinases, CDK4 and CDK6, contained several low molecular weight, associated 

15 proteins. These included pi 6 and two additional proteins of approx. 15 and 15.5 kDa. These 
proteins were not recovered in parallel CDC2 or CDK2 immunoprecipitates but were 
recovered in anti-pl6 inununoprecipitates, suggestmg that pl5, pl5.5 and pl6 might be 
related. This was confirmed by westem blotting of CDK4 and CDK6 immunoprecipitates 
which demonstrated that pl5 and pl5.5 were weakly cross-reactive widi the pl6 antiserum. 

20 To isolate clones encodmg putative pi 6 relatives, we constructed a cDNA library 

from TGF-P arrested HaCaT cells (Boukamp et al. (1988) J Cell Biol 106:761-771) and 
probed this library at low-stringency with the pl6 coding sequence (SEQ ID No. 1). One 
clone obtained in this screen encoded a 137 amino acid protein (predicted M.W. 14.7 kDa.) 
with homology to pi 6. Based upon this homology and upon biochemical properties 

25 described below, we have named this protein pl5. The first 50 amino acids of pi 5 and pl6 
share approx. 44% identity. This is followed by an 81 ammo acid region of approx. 97% 
identity (see Figure 6) after which pi 5 and pi 6 sequences diverge. The sequence of pi 5 can 
be divided into four ankyrin repeats suggesting that this structural motif is conserved in the 
CCR-protein fiamily. In vitro translation of the pi 5 cDNA produced a protem which precisely 

30 comigrated with the pl5 band present in CDK4, CDK6 and pl6 immunoprecipitates from 
TGF-p arrested HaCaT cells. Identity of these proteins was confirmed by protease and 
-chemical cleavage mapping. 

To investigate the fimctional sunilarity of pi 5 and pi 6, we expressed pi 5 as a fiision 
protein in bacteria and tested its ability to bind and inhibit cyclin dependent kinases, pi 5 

35 specifically bound CDK4 and CDK6 but did not appreciably bind CDC2, CDK2 or CDK5. 
To assess the consequences of binding, pi 5 was added to active cyclin/CDK complexes 
expressed in insect cells, pi 5 specifically inhibited the cyclin D/CDK4 and cyclin D/CDK6 
enzymes but had no effect on CDK2/cyclin A kinase. Thus pi 5 is a fimctional member of the 
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CCR-protein fiunily. Moreover, FISH mapping of the pi 5 gene demonstrated that this gaie 
lies adjacent to the pi 6 gene at 9p2 1 . 

While we first noted pi 5 in inunimoprecipitates fi-om HaCaT cells which had been 
arrested in Gl by serum starvation and re-stimulation in the presence of TGF-P, by 
5 comparison, we found that asynchronous, rapidly proliferating HaCaT cells contained 
considerably lower levels of pi 5 in CDK4 and CDK6 immunoprecipitates. To separate 
effects of TGF-P treatment from effects of Gl arrest, asynchronous cultures were treated with 
TGF-P for various periods, after which patterns of CDK4 and CDK6 associated proteins were 
examined. In as little as four hours following TGF-p addition, pl5 levels rose in CDK4 and 

10 CDK6 immunoprecipitates, reaching peak levels after 6-8 hours. In contrast, CDK-associated 
pi 6 levels were unaffected by TGG-p. Northern blotting of RNA from cultures treated in 
parallel revealed tfiat increased CDK4-associated pi 5 reflected increased abundance of pi 5 
mRNA. In 2 hours following TGF-p treatment, pl5 mRNA began to rise and reached a peak 
induction of approx. 30-fold after 6-8 hours. In contrast, pl6 mRNA levels did not vary. 

15 Two other mechanisms for TGF-p mediated cell cycle arrest have been previously 

proposed. In MvlLu cells, TGF-p treatment suppressed CDK4 synthesis. This was deemed 
causal since cells could be rendered resistant to TGF-p by constitutive overexpression of 
CDK4. In HaCaT cells, TGF-p treatment had no efifect on CDK4 protein or mRNA levels. 
Based upon the properties of pi 5, we would predict that CDK4 overexpression could also 

20 render HaCaT cells TGF-p resistant by titrating the pl5 CDK4/CDK6 inhibitor. p27, a CDK 
inhibitor which was purified from TGF-p arrested cells, has also been proposed as a link 
between TGF-p and cell cycle control. However, in HaCaT cells, TGF-p treatment had no 
efifect on p27 mRNA levels. Thus any contribution ibat p27 may make to TGF-P mediated 
cell cycle arrest in these cells must occur by regulation at the post-translational level. 

25 Considered together, our data suggest that pi 5 may function as an effector of TGF-P 

mediated cell cycle arrest via inhibition of CDK4 and CDK6 kinases, pi 5 may be the sole 
mediator of TGF-p induced arrest in some ceUs, or may cooperate with other TGF-p 
responsive pathways^ TGF-p can regulate differentiation in some cell types, and the ability of 
TGF-p to affect cell cycle progression through pi 5 may also contribute to these processes. 

30 Moreover, cytogenetic abnormalities at 9p21 are common in many types of human 

tumors suggestmg the presaice of a tumor stqjpressor gene at fliis locus. An mherited cancer 
syndrome v^ich causes predisposition to melanoma also maps to 9p21 . In addition to our 
data presented herein, and in U.S.SJ>I 08/248,812 and in 08/227,371, pl6 was initially 
proposed as a candidate for both of these activities based upon analysis of pl6 deletions and 

35 point mutations in cell lines. However, the presence of a second functional member of the 
pl6 family at 9p21 raises the possibility that loss of tumor suppression may mvolve 
inactivation of either or both genes. The response of pl6 to viral oncoproteins indicates that it 
may function in intracellular growth regulatory pathways, while results presented here 



I 
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suggest that pi 5 may transduce extacellular growth inhibitory signals. Thus deletions of 9p21 
which remove both genes (or other mutations that might inactivate both) could 
simultaneously negate two major proliferative control pathways. In this regard, the ability of 
TGF-P to induce growth arrest is reduced or lost in many neoplastically transformed cell 
5 lines. In particular, melanocytes are sensitive to growth inhibition by TGF-P, but many 
metastatic melanoma cells are TGF-p resistant. 



Example I 

1 0 Demonstration of Selective Submit Rearrangement of Cell-cycle Complexes In Association 
With Cellular Transformation by a DNA Tumor Virus or Its Oncogenic Product 

(i) Cellular Transformation With DNA Tumor Virus SV40 Is Associated With Subunits 
Rearrangement of Cell-cycle Complexes 

IS Preparation of [^^S] methionine-labelled cell lysates and polyacrylamide gel 

electrophoresis were as described above, as well as described in PCT Publication No. 
WO92/20796. Cell lysates were prepared from either human normal diploid fibroblast cells 
WI38 or DNA tumor vmis SV40 transformed WI38 cells, VA13. Cell lysates were 
immunoprecipitated with antibodies against each cell-cycle gene products. 

20 

(ii) Subunit Rearrangements of Cell-cycle Complexes In Two Different Pair Cell Lines 

Methods for preparation of cell lysates are the same as described above. Two 
different pair cell lines were used in these experiments. HSF43 is a normal human diploid 
fibroblast cell line and CTIO (full name CT10-2C-T1) is a derivative of HSF43 transformed 
25 by SV40 large tumor antigen. CV-1 is an Afiican green monkey kidney cell line and COS-1 
is a derivative of CV-1 transformed by SV40. 

(Hi) Cellular Transformation by DNA Tumor Virus SV40 Is Associated With Rearrangement 
ofPCNA Subunit of Cell-cycle Complexes 
30 Preparation of cell lysate, electrophoresis, and Westem blotting conditions are the 

same as described above. Normal human diploid fibroblast cell lines and their SV40 
transformed cell lines are described above. Immunoprecipitates derived from each antibody 
were separated on polyacrylamide gels and blotted with anti-PCNA antibody. 

35 (iv) Cellular Transformation by DNA Tumor Virus SV40 Is Associated With Rearrangement 
ofCDK4 Subunit of Cell-cycle Complexes 

Preparation of cell lysate, electrophoresis, and Westem blotting conditions are the 
same as previously described. Normal human diploid fibroblast cell lines and their SV40 
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transfonned cell lines are described above. Immiinoprecipitates derived from each antibody 
were separated on polyacrylamide gels and blotted with anti-CDK4 antibody. 

S Example 2 

Cloning of pi 6, an inhibitor of CDK4 activity 

(i) Cloning of pi 6 using the two hybrid assay 

Saccharomyces cerevisiae YPB2 cells were transfonned simultaneously with a 
10 plasmid containing a GAL4db-pl6 fusion and with a plasmid containing, respectively, the 
GAL4ad fused to cdc2 (CDKl), CDK2, CDK4, CDK5, PCNA (proliferating cell nuclear 
antigen), and the fission yeast kmase Snf 1 . After growing cells in medium selective for both 
plasmids (minus tryptophan and minus leucine), two colonies were picked randomly and 
were streaked in plates that either contained or lacked histidine. The ability to grow in the 
IS absence of histidine depends on the e;q>ression of the HIS3 gene that is under a GAL4- 
responsive promoter and, therefore, indicates that a functional GAL4 activator has been 
reconstituted through the interaction of pl6 with the conesponding target protein. 

(ii) Interaction of pi 6 CDKs 

20 Purified bacterially-produced GSTpl6 fusion protein was mixed with (35S)-labeled in 

vitro translated cdc2, CDK2, CDK4 and CDKS. Mixtures contamed O.S ^ig of purified GST- 
pl6 and an equivalent amount of in vitro translated protein (between O.S to S ,jil; TNT 
Promega) m a fmal volume of 200 fil of a bufifer containing 50 mM Tris-HCl pH 8, 120 mM 
NaCl and 0.5% Nonidet P-40. After 1 h at 4®C, 15 ^il of glutathione-agarose beads were 

25 added and incubation was resumed for an additional hour. Beads were recovered by 
centrifugation, washed 4 tunes with the incubation bufier, and mixed with standard protein- 
gel loading buffer. Samples were loaded into a 15% poly-acryllamide gel and (^^S>labeled 
proteins were detected by fluorography. The GSTpl6 fusion protein was overexpressed m Ae 
pGEX-KG vector and purified by standard techniques. The in vitro translation templates were 

30 derived &om the pBluescript vector (Stratagene). 

(Hi) Proteolytic mapping of pi 6 

The in vitro translated (35S)-labeled pl6 (TNT Promega) was obtained using the pl6 
cDNA cloned into pBluescript vector (Stratagene) as a template, and the CDK4-associated 
35 pi 6 protein was co-immunoprecipitated with an anti-CDK4 serum fiom metabolically (^^S)- 
labeled HeLa cells lysates. Partial proteolysis was done over the corresponding gel slices 
after extensive equilibration in a buffer and digestion was accomplished by addition of NCS 
at different concentrations. The products were run in a 17.5% polyacrylamide gel and 
detected in a phosphoimager Fujix 2000. 
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(iv) Detecting the effects of pi 6 on CDK4-cyclin D complexes 

Baculovirus-infected insect cells overexpressing pi 6, CDK4, cyclin Dl, or both 
CDK4 and cyclin Dl together were metabolically (35S)-labeled. The different incubation 
5 mixtures were composed by extracts containing pi 6, CDK4, cyclin Dl and both CDK4 and 
cyclin Dl, and were immimoprecipitated with anti-pl6 serum, anti-CDK4 serum without any 
previous preincubation, and anti-CDK serum preincubated with the peptide originally used to 
raise the antiserum and anti-cyclin Dl serum. Immunoprecipitates were then analyzed by 
SDS-PAGE. 

10 

Chromosomal Mapping of pi 6 



Genomic clones of the human pi 6 gene were isolated by stringency screening (dS^'C 
15 wiih O.lxSSC wash) of a APDOI human genomic library (Strategene) with cDNA probes. 
Isolated phage clones were confirmed by high stringency Soutfaem hybridization and/or 
partial sequence analysis. Purified whole phage DNA was labelled for fluorescent in situ 
hybridization (FISH) analysis. 

FISH analysis was perfomied using established methods (Demetrick et al. (1994) 
20 Cytogenet Cell Genet 66:72-74; Demetrick et al. (1993) Genomics 18:144-147; and DeMarini 
et al. (1991) Environ Mol Mutagen 18:222-223) on methotrexate or thymidine synchronized, 
phytohemagglutinin stimulated, normal peripheral blood lymphocytes. Suppression with a 
mixture of sonicated human DNA and cotl DNA was required to reduce the background. 
The stained slides were counterstained with propidiem iodide (for an R banding pattem) or 
25 DAPI and actinomycin D (for a DA-DAPI banding pattern), mounted in antifade medium and 
visualized utilizing a Zeiss Axiphot microscope. Between 30 and 100 mitotises were 
examined for each gene location. Photographs were taken using a cooled CCD camera. 
Alignment of three color fluorescence was done under direct visualization through a triple 
bandpass filter (FITC/Texas Red/DAPI). The pl6 gene was visualized to map to 9p21-22. 

30 



35 Example 4 

Cloning Of Mouse CCR-proteins 

Utilizing a probe consisting of the coding region of the human pi 6 gene, we have 
screened a mouse embryonal stem cell libraiy and have isolated a genomic clone containing 
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the coding region for a mouse homolog of the human pi 6 gene described above. This clone 
was isolated under low to moderate stringency conditions (1 x SSC at SO'^C). This DNA 
(14kB) has been cloned in two independent pieces and the restriction map for nine restriction 
endonucleases has been performed (see Figure 5). The mouse CCR-gene has been 
5 completely sequenced and the coding region is apparently made up of only two exons that 
have been located in the restriction map by Southern hybridization. The apparent molecular 
weight of the mouse CCR-protein is 13.5kDa, and the nucleic acid and amino acid sequence 
of the mouse pl6 homolog, termed herein "pl3.5", is given in SEQ ID No. 5 and 6 
respectively. 

10 Utilizing a probe based on the nucleotide sequence of the conserved region of Metl- 

ArgSO (see Figure 6) of the mouse pi 3.5 clone, a pi 9 embryonal carcinoma library 
(Stratagene) was probed under high stringency conditions. Several clones were isolated fiom 
the library which were significantly homologous to the pi 3.5 clone. Sequence comparison of 
one of the murine clone, the sequence for which is provided in SEQ ID Nos. 7 and 8, 

1 5 indicated that it was the murine homolog of pi 5 (described in Example 5 below). 

Examk 5 
Cloning of pi 5 from human cells 

20 The sequence of the pi 5 cDNA is shown along with the deduced amino acid sequence 

of the protein in SEQ ID Nos. 3 and 4 respectively. The deduced amino acid sequence of pl5 
was compared to that of pl6 (e.g. see Figure 6), and areas of homology were identified using 
the BLAST program. 

25 0) Cell Culture. 

HaCaT cells were routinely maintained m DMEM containing 10% fetal bovine serum 
(FBS) ( Boukamp et al. (1988) J Cell Biol 106:761-771). For TGF-p arrest, HaCaT ceUs 
were grown to confluence in DMEM containing 10% FBS and then serum starved for 3 days 
in DMEM containing 0.1% FBS. Cells were re-stimulated by addition of new media 

30 containing 10% FBS and 2 ng/ml TGF-p (purified from human platelets, Calbiochem). For 
library construction, RNA was prepared 22 hours after re-stunulation. For 
immunoprecipitations experiments, cells were labelled with 35s-methionine in the presence 
of TGF-P for four hours beginning at 19 hours after re-stimulation. 

35 (li) Library Construction and Screening. 

RNA was prepared firom TGF-p treated cells and from cells that were serum starved 
and tiien re-stimulated in the absence of TGF-p using RNAZOL B according to the 
manufiurturer's instructions. The cDNA library which was used to isolate the pi 5 clone was 
constructed from a mixture of RNA derived ftom treated and untreated cells. Messenger 
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RNA preparation and cDNA synthesis were exactly as previously described (Hannon et al. 
(1993) Genes Dev. 7:2378-2391). Double-stranded cDNA was ligated into X-ZapII aims 
according to the manufacturers instructions. Low stringency hybridization was performed at 
50X in 500 mM NaP04, pH 7,0, 1 mM EDTA, 15% Formamide, 7% SDS, 0.1% bovine 
5 serum albumin (wash: 1 x SSC, 50°C). The pi 5 cDNA was also isolated from a human 
mammary epithelial cell library. Cell lysis, immunoprecipitation and protease mapping were 
performed exactly as previously described (Xiong et al. (1993) Nature 366:701-704) For 
chymotrypsin digestion, either 7|ig or L5|ig of enzyme was used. Gels for chymotrypsin 
mapping contained 0.1% SDS. 

10 

Example 6 

pis is a specific inhibitor of CDK4 and CDK6 

Preparation of GST-pl6 fiision protein rom bacteria is described above. GST-pl5 
1 5 was prepared identically. 

(i) pis binds CDK4 and CDK6 

In vitro translated CDKs were prepared using the TNT-lysate in vitro translation kit 
(Promega) according to the manufacturer's instructions. For the binding assay, GST-pl5 or 

20 GST-pl6 (250 ng) was incubated with in vitro translated CDKs, e.g. 35s-labelled CDC2, 
CDK2, CDK4, CDKS or CDK6, for 30 minutes at 30°C in 30 \i\ containing 20 mM Tris, pH 
8.0, 10 mM MgCl2, 1 mM EGTA. Following mcubation, mixtures were diluted to 250 ^il in 
IP buffer (50 mM Tris, pH 8.0, 150 mM NaCl, 0.5% NP^O) and incubated for 1 hour with 
12.5^1 of glutathione sepharose. Bound proteins were recovered on glutathione sepharose, 

25 washed four times with 1 ml of IP buffer, and then released by boiling in SDS gel sample 
buffer. Bound proteins were analyzed by electrophoresis in a 17.5% PAGE gel. For 
comparison, a simUar experiment using GSl -pi 6 was carried out. 

Cii) pis binding inhibits cyclin D/CDK4 kinase 

30 Active cyclin D/CDK4 kinase, present in lysates of baculovirus infected insebt cells, was 
incubated with increasing quantities of GST-pl5 for 30 minutes at 30®C. Preparation of 
baculovirus lysates contaming active cyclin D/CDK4 kinase is described above. Lysates 
containing active cyclin D/CDK6 were prepared identically. For kinase assays lO^il of 
baculoviral lysates were mixed with approximately 0, lOng, 20ng, 50ng, or lOOng of GST- 

35 pi 5 or GST-pl6 and incubated for 30 minutes at 30°C in a total volume of 30^1 containing 
20 mM Tris, pH 8.0, 10 mM MgCl2, l>nM EGTA. Following incubation, Q,S\ig GST-RB 
and 0.5 ^1 of y-32p.ATP (5 jilCi, 3000 Ci/mMol) were added for 10 minutes at 30^C as 
substrates. Reactions were stopped by the addition of 250^1 of IP buffer and 15fil of 
glutathione sepharose. After a further 1 hour of incubation at 4''C, beads were washed four 
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times with IP buffer before release of bound proteins and subsequent electrophoresis. Like 
pi 6, the pl5-GST fusion protem was able to inhibit the kinase activity of CDK4. 

(in) pi 5 inhibits cyclin D/CDK6 kinase 
5 Cyclin D/CDK6 complexes were produced in baculovirus infected insect cells and 

incubated with increasing quantities of GST-pl5 or GST-pl6 as described above. Following 
incubation, the ability of these complexes to catalyze GST-RB phosphorylation was 
determined. Both pl5 and pl6 demonstrated an ability to inhibit the CDK6 kinase activity. 

10 Example 7 

TGF-P treatment increases association of pi 5 with CDK4 and CDK6 

HaCaT cells were cultured as described above. TGF-P treatment was initiated by 
adding TGF-p to existing media to 2ng/nil. Fo r the last two hours of TGF-p treatment, cells 

15 were labelled with ^^S-methionine. For cell labelling, media was changed to DMEM minus 
methionine (Gibco-BRL) containing 2ng/ml of TGF-p, 10% dialyzed FES and 0.5 mCi/ml of 
35S-methionine/cysteine (trans-label, NEN). Cells were lysed and proteins were 
incununoprecipitated using either anti-CDK4 or anti-CDK6 antibodies. Cell lysis and 
immunoprecipitation were as described above. HaCaT cultures treated with TOF-p in 

20 parallel were stamed with DAPI and analyzed by FACS. For 8 hours following TGF-p 
addition, there was no appreciable change in the percentage of Gi cells. After fourteen hours, 
the Gi population began to increase. Arrest of HaCaT cells in Gj (approx. 85% Gi cells) 
required at least 24 hours after treatment of an asynchronous culture. 

25 Example S 

TGF-P treatmcr t induces pi 5 mRNA 

Asynchronous HaCaT cultures were treaied with TGF-p for the indicated times. Total 
RNA was prepared from treated cells and used for Northern blotting with a probe specific for 

30 pi 5 (e.g. probes consisted of the first coding exons of this genes and was prepared by PCR). 
Hybridization used for northem blots consisted of 200 mM NaP04, pH 7.0, 1 mM EDTA, 
15% Foimamide, 7% SDS, 0.1% bovine serum albumm at 65°C; wash: 0.2xSSC, 65^C. The 
"pl5 probe recognized fliree pl5 mRNAs of approximately 0.8, 2.2 and 3.2 Kb. Northem 
signals were quantitated on a Fuji BAS2000 phosphorimager and plotted to give a graphical 

35 representation of the results. RNA amoimts were nonnalized by mass, and over-probing of 
Ae blot with a human actin probe suggested no more than 10% variance m RNA amounts 
between lanes. 

RNAs identical to those used in pi 5 probe panel were also probed with a fragment 
specific for pl6. Furthermore, RNA from TGF-p treated HaCaT cells was probed 
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oligonucleotides derived from either a fiiagment of human CDK4 coding sequence or a 
fragment of the p27 cDNA. 



S Example 9 

Generating a Transgenic Mouse pi 3, 5 Knockout 

The disruptive construct is formed by the two DNA regions of approx. 3-4Kb flanking 

the pi 3.5 gene. These DNA pieces are cloned at both sides of a gene marker that will be used 
10 to select the mouse embryonal stem (ES) cells that have incorporated this DNA after 

transfection. Regions which are homologous to the pi 6 locus are in turn flanked by another 

marker which allows selection against cells which have incorporated the disruption vector by 

non-homologous recombination (e.g. at a locus other than that of the mouse pl3,5 gene). 

Those cells where insertion has occurred in the appropriate position are injected into mouse 
IS blastocytes and implanted into the appropriate female mice following standard protocols 

{Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 

Hari)or, N. Y., 1986; and Jaenisch (1 988) Science 240: 1468-1474.) 

Chimeric pups resulting from the engineered blastocysts can be identified by a coat 

color marker specific to the transfected ES cells (agouti). Mice with high degrees of 
20 chimerism are crossed to identify those with chimeric germ lines and to generate non- 

chimeric heterozygous disruptants. Homozygous disruptants are derived by breeding the 

non-chimeric hetero2^gotes. 



25 Characterization of CCR-insensitive CDK4 Mutants 

A mutant of CDK4 was discovered in cells from a melanoma patient, and the 
sequence of the mutant enzyme was determined. From the sequence, it was determined that 
that the gene contamed a Arg^Cys at position 24 (see SEQ ID No. 9). 

30 We leisolated the mutant by cloning CDK4 (see Matsushime et al. (1992) Cell 

71:323-334) and generated a the cysteine mutation by oligonucleotide primer mutagenesis. 
To characterize the effect of the mutation, we compared the mutant and wild-type enzyme 
based on a number of different criteria, including intrinsic activity (e.g. did the mutant 
constitutively activate CDK4), as well as the ability of other regulatory proteins to control 

35 CDK4 activation. Briefly, we generated a series of baculovirus ^pression systeins for over- 
expressing various proteins. In particular, Sf? cell lysates (Desai et al. (1992) Mol Cell Biol 
3:571-582, and Example 6 above) were obtained for mutant and wild-type CDK4, cyclin Dl, 
pl6, pl5, p21 and p27 (see Polyak et al. (1994) Genes Dev 8:9-22; and Toyoshima et al. 
(1994) Cell 78:67-74). Using a GST-RB fusion protein (Example 6) as a substrate for 
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detecting CDK4 kinase activity, various combinations of lysate were lixed and tested for 
CDK4 activation/inhibition. 

When the mutant CDK4 was expressed alone in Sf9 cells, no appreciable 
phosphorylation of the RB substrate was detected, as is also the case with the wild-type 
5 enzyme, indicatmg that the mutation did cause constitutive acitvation of CDK4. 
Overexpression of a CDK4 and cyclin Dl in an Sf9 lysate was also identical for both mutant 
and wild-type kinase, as each was shown to be activated in the presence of cyclin Dl. 
However, upon addition of increasing amounts of either pl6- or pl5-containing lysate to the 
CDK4/cyclin D mixture, the wild-type CDK4 was inhibited yet the mutant CDK4 was 

10 relatively unaffected, indicating that the mutation gave rise to kinase \^ose activity is 
insensitive to ei&er pl5 or pl6. Furthermore, immunoprecipitation demonstrated that neither 
pis or pi 6 were capable of binding the mutant, as they were apparently Ibist from the 
complex which is ordinarily seen with the wild-type CDK4. Fmally, similar experiments 
carried out with p21 and p27 indicated that the particular mutation, Arg24-Cys, did not effect 

15 the binding or inhibitory ability of either of those proteins. An analogous mutation to Arg3 1 
of CDK6 (SEQ ID No. 10; and Bates et al. (1994) Oncogene 9:71-79 for the wUd-type gene) 
is expected to have the same effect. 

Utilizing the Arg-24 residue as a reference point, we have further identified by 
molecular modeling other residues which may also be involved in the recognition of pl6/pl5. 

20 Utilizing the coordinates for CDK2 (DeBondt et al. (1993) Nature 363:595-602; Endicott et 
al. (1994) Prot Eng 7:243-253; and Morgan et al. (1 994) Cwr Opin Cell Biol 6:239-246)) we 
have constructed a model for CDK4. Focusing our attention on residues in the spatial 
vicinity of Arg-24 and tiiat are conserved between CDK4 and CDK6 (but different from 
CDK2 or CDC2, we have recombinantly generated and analyzed a number of new CDK4 

25 mutants for their ability to bind pl6. These mutants and their pl6-binding abilities are 
summarized in Table 1 below (see also Figure 7). 

Three changes abolished the interaction with pi 6. When these changes were 
visualized onto the 3-dimensional structure, it was apparent that these residues form a cluster 
of four amino acid residues accessible to solvent These residues, K22, R24, H95 and D97 

30 define a surface in the small lobe of CDK4, in very close proximity to the ATP bindmg site, 
but far away from the cyclin binding site or Ae substrate binding site. This surface likely 
represents at least a portion of the pl6/pl5-recognition surface present in CDK4 (and 
homologously in CDK6). Accordingly, an attractive model for pl6/pl5 inhibition of 
CDK4/CDK6 provides an occlusion or distorting effect to the ATP-bmding site upon bmding 

35 of the CCR protein such that ATP either does not bind to CDK4 or is not properly positioned 
to be used as a phosphate donor. 



Table 1 

pi 6 binding to CDK4 mutants 
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residue # 


conservation 


mutation 


pi 6 binding 


7 


CDK4/CDK6 specific 


E^Q 


no effect 


10 


CDK4/CDK6 specific 


A->E 


no effect 


11 


CDK4/CDK6 specific 


E->K 


no effect 


22 


conserved in a CDKs 


K->A 


no binding to pi 6 


24 


conserved in a CDKs 


R->S 


no binding to pi 6 


25 


CDK4/CDK6 ^ecific 


D->N 


no effect 


31 


CDK4/CDK6 specific 


F->V 


no effect 


78 


CDK4/CDK6 specific 


C^I 


no effect 


81 


CDK4/CDK6 specific 


S^E 


no effect 


82 to 86 


CDK4/CDK6 specific 


RTDRE^N 


no effect 


95 to 97 


CDK4/CDK6 specific 


HVD^FLH 


no binding to pi 6 



5 All of the above-cited references and publications are hereby incorporated by 

reference. 



Eqtuvalenis 

10 Those skilled in the art wiU recognize, or be able to ascertain using no more than 

routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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(1) GENERAL INFORMATION: 

' (i) APPLICANT: 

(A) NAME: Cold Spring Harbor Laboratory 

(B) STREET: One Bxmgtown Road 

(C) CITY: Cold Spring Harbor 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 11724 

(ii) TITLE OF INVENTION: Cell -Cycle Regulatory Proteins, and Uses 
Related Thereto 

(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: AscII (text) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/346,147 

(B) FILING DATE: 29-NOV-1994 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/306,511 

(B) FILING DATE: 14-SEP-1994 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/248,812 

(B) FILING DATE: 2S-MAY-1994 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/227,371 

(B) FILING DATE: 14 -APR- 1994 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B.) LOCATION: 41.. 508 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
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CGGAGA6GGG GAGAACAGAC AACGGGCGGC GGGGAGCAGC ATG GAT CCG GCG GCG 55 

Met Asp Pro Ala Ala 
1 5 

5 

GGG AGC AGC ATG GAG CCT TCG GCT GAC TGQ CTG GCC ACG GCC GCG GCC 103 
Gly Ser Ser Met Glu Pro Ser Ala Asp Trp Leu Ala Thr Ala Ala Ala 
10 15 20 

10 CGG GGT CGG GTA GAG GAG GTG CGG GCG CTG CTG GAG GCG GTG GCG CTG 151 
Arg Gly Arg Val Glu Glu Val Arg Ala Leu Leu Glu Ala Val Ala Leu 

25 30 35 

CCC AAC GCA CCG AAT AGT TAC GGT CGG AGG CCG ATC CAG GTC ATG ATG 199 
IS Pro Asn Ala Pro Asn Ser Tyr Gly Arg Arg Pro lie Gin Val Met Met 
40 45 SO 

ATG GGC AGC GCC CGA GTG GCG GAG CTG CTG CTG CTC CAC GGC GCG GAG 247 
Met Gly Ser Ala Arg Val Ala Glu Leu Leu Leu Leu His Gly Ala Glu 
20 55 60 65 

CCC AAC TGC GCC GAC CCC GCC ACT CTC ACC CGA CCC GTG CAC GAC GCT 295 
Pro Asn Cys Ala- Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala 
70 75 80 85 

25 

GCC CGG GAG GGC TTC CTG GAC ACG CTG GTG GTG CTG CAC CGG GCC GGG 343 
Ala Arg Glu Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala Gly 
90 95 100 

30 GCG CGG CTG GAC GTG CGC GAT GCC TGG GGC CGT CTG CCC GTG GAC CTG 391 
Ala Arg Leu Asp Val Arg Asp Ala Trp Gly Arg Leu Pro Val Asp Xieu 
105 110 115 

GCT GAG GAG CTG GGC CAT CGC GAT GTC GCA CGG TAC CTG CGC GCG GCT 439 
35 Ala Glu Glu Leu Gly His Arg Asp Val Ala Arg Tyr Leu Arg Ala Ala 
120 125 130 

GCG GGG GGC ACC AGA GGC AGT AAC CAT GCC CGC ATA GAT GCC GCG GAA 487 
Ala Gly Gly Thr Arg Gly Ser Asn His Ala Arg lie Asp Ala Ala Glu 
40 135 140 145 

GGT CCC TCA GAC ATC CCC GAT TGAAAGAACC AGAGAGGCTC TGAGAAACCT 538 
Gly Pro Ser Asp lie Pro Asp 
150 155 

45 

CGGGAAACTT AGATCATCAG TCACCGAAGG TCCTACAGGG CCACTUICTGC CCCCGCCACA 598 
ACCCACCCCG CTTTCGTAGT TTTCATTTAG AAAATAGAGC TTTTAAAAAT GTCCTGCCTT 658 
50 TTAACGTAGA TATAAGCCTT CCCCCACTAC CGTAAATGTC CATTTATATC ATTTTTTATA 718 
TATTCTTATA AAAATGTAAA AAAGAAAAAC ACCGCTTCTG CCTTTTCACT GTGTTG6AGT 778 
TTTCTGGAGT GAGCACTCAC GCCCTAAGCG CACATTCATG TGGGCATTTC TTGCGAGCCT 838 

55 

CGCAGCCTCC GGAAGCTGTC GACTTCATGA CAAGCATTTT GTGAACTAGG GAAGCTCAGG 898 



6GGGTTACTG GCTTCTCTTG AGTCACACTG CTAGCAAATG GCAGAACCAA AGCTCAAATA 958 
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AAAATAATUIT TATTTTCATT CATTCACTCA AAAAAA 994 

S (2) INFORMATION FOR SEQ ID N0:2: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

15 

Met Asp Pro Ala Ala Gly Ser Ser Met Glu Pro Ser Ala Asp Trp Leu 
15 10 15 

Ala Thr Ala Ala Ala Arg Gly Arg Val Glu Glu Val Arg Ala Leu Leu 
20 20 25 30 

Glu Ala Val Ala Leu Pro Asn Ala Pro Asn Ser Tyr Gly Arg Arg Pro 
35 40 45 

25 He Gin Val Met Met Met Gly Ser Ala Arg Val Ala Glu Leu Leu Leu 
50 55 60 

Leu His Gly Ala Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu Thr Arg 
65 70 75 80 

30 

Pro Val His Asp Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu Val Val 
85 90 95 

Leu His Arg Ala Gly Ala Arg Leu Asp Val Arg Asp Ala Trp Gly Arg 
35 100 105 110 

Leu Pro Val Asp Leu Ala Glu Glu Leu Gly His Arg Asp Val Ala Arg 
115 120 125 

40 Tyr Leu Arg Ala Ala Ala Gly Gly Thr Arg Gly Ser Asn His Ala Arg 
130 135 140 

He Asp Ala Ala Glu Gly Pro Ser Asp He Pro Asp 
145 150 155 

45 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 850 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 



10 
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(A) NAME/KEY: CDS 

(B) LOCATION: 338.. 751 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GCGGGGCAGT GAGGACTCCG CGACGGTCC6 CACCCTGCGG CCAGAGCGGC TTTGAGCTCG 60 

GCTGCTTCCG CGCTAGGCGC TTTTTCCCAG AAGCAATCCA GGCGCGCCCG CTGGTTCTTG 120 

AGCGCCAGGA AAAGCCCGGA GCTAACGACC GGCCGCTCGG CACTGCACGG GGCCCCAAGC 180 

CGCAGAAGAA GGACGACGGG AGGGTAATGA AGCTGAGCCC AGGTCTCCTA GGAAGGAGAG 240 

IS AGTGCGCCGG AGCAGCGTGG GAAAGAAGGG AAGAGTGTCG TTAAGTTTAC GGCCAACGGT 300 

GGATTATCCG GGCCGCTGCG CGTCTGGGGG CTGCGGA ATG CGC GAG GAG AAC AAG 355 

Met Arg Glu Glu Asn Lys 
1 5 

20 

GGC ATG CCC AGT GGG GGC GGC AGC GAT GAG GGT CTG GCC AGO GCC GCG 403 
Gly Met Pro Ser Gly Gly Gly Ser Asp Glu Gly Leu Ala Ser Ala Ala 
10 15 20 

25 GCG CGG GGA CTA GTG GAG AAG GTG CGA CAG CTC CTG GAA GCC GGC GCG 451 
Ala Arg Gly Leu Val Glu Lys Val Aarg Gin Leu Leu Glu Ala Gly Ala 

25 30 35 

GAT CCC AAC GGA GTC AAC CGT TTC GGG AGG CGC GCG ATC CAG GTC ATG 499 
30 Asp Pro Asn Gly Val Asn Arg Phe Gly Arg Arg Ala lie Gin Val Met 
40 45 50 

ATG ATG GGC AGC GCC CGC GTG GCG GAG CTG CTG CTG CTC CAC GGC GCG 547 
Met Met Gly Ser Ala Arg Val Ala Glu Leu Leu Leu Leu His Gly Ala 
35 55 60 65 70 

GAG CCC AAC TGC GCA GAC CCT GCC ACT CTC ACC CGA CCG GTG CAT GAT 595 
Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp 
75 80 85 



40 



GCT GCC CGG GAG GGC TTC CTG GAC ACG CTG GTG GTG CTG CAC CGG GCC 643 
Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala 
90 95 100 



45 GGG GCG CGG CTG GAC GTG CGC GAT GCC TGG GGT CGT CTG CCC GTG GAC 691 
Gly Ala Arg Leu Asp Val Arg Asp Ala Trp Gly Arg Leu Pro Val Asp 
105 110 115 

TTG GCC GAG GAG CGG GGC CAC CGC GAC GTT GCA GGG TAC CTG CGC ACA 739 
SO Leu Ala Glu Glu Arg Gly His Arg Asp Val Ala Gly Tyr Leu Arg Thr 
120 125 130 

GCC ACG GGG GAC TGACGCCAGG TTCCCCAGCC GCCCACAACG ACTTTATTTT 791 
Ala Thr Gly Asp 
55 135 



CTTACCCAAT TTCCCACCCC CACCCACCTA ATTCGATGAA GGCTGCCAAC GGGGAGCGG 



850 
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(2) INFORMieiTION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Arg Glu Glu Asn Lys Gly Met Pro Ser Gly Gly Gly Ser Asp Glu 
1 5 10 15 

Gly Leu Ala Ser Ala Ala Ala Arg Gly Leu Val Glu Lys Val Arg Gin 
20 25 30 

Leu Leu Glu Ala Gly Ala Asp Pro Asn Gly Val Asn Arg Phe Gly Arg 
35 40 45 

Arg Ala He Gin Val Met Met Met Gly Ser Ala Arg Val Ala Glu Leu 
50 55 60 

Leu Leu X^u His Gly Ala Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu 
€5 70 75 80 

Thr Arg Pro Val His Asp Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu 
85 90 95 

Val Val Leu His Arg Ala Gly Ala Arg Leu Asp Val Arg Asp Ala Trp 
100 105 110 

Gly Arg Leu Pro Val Asp Leu Ala Glu Glu Arg Gly His Arg Asp Val 
115 120 125 

Ala Gly Tyr Leu Arg Thr Ala Thr Gly Asp 
130 135 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 853 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 213.. 567 



(xi) 



SEQX7ENCE DESCRIPTION: SEQ ID NO: 5: 



10 
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G6AGTACAGC AGCG6GA6CA TGGGTC6CAG GTTCTTGGTC ACTGTAAGGA TTCAGCGCGC 60 

GGGCCGCCCA CTCCAAGAGA GGGTTTTCTT GGTGAAGTTC GTGCGATCCC GGAGACCCAG 120 

GACAGCGAGC T6CGCTCTGG CTTTCGTGAA CATGTTGTT6 AGGCTAGAGA GGATCTTGAG 180 

AAGAGGGCCG CACCGGAATC CTGGACCA6G TG ATG ATG AT6 GGC AAC GTT CAC 233 

Met Met Met Gly Asn Val His 
1 5 

GTA 6CA GCT CTT CTG CTC AAC TAC GOT GCA GAT TCG AAC TGC GAG GAC 281 
Val Ala Ala Leu Leu Leu Asn Tyr Gly Ala Asp Ser Asn Cys Glu Asp 
10 15 20 

IS CCC ACT ACC TTC TCC CGC CCG GT6 CAC GAC GCA GCG CGG GAA GGC TTC 329 
Pro Thr Thr Phe Ser Arg Pro Val His A3j? Ala Ala Arg Glu Gly Phe 
25 30 35 

CTG GAC ACG CTG GTG GTG CTG CAC GGG 7CA GGG GCT CGG CTG GAT GTG 377 
20 Leu Asp Thr Leu Val Val Leu His Gl> Ser Gly Ala Arg Leu Asp Val 
40 45 50 55 

CGC GAT GCC T6G GGT CGC CTG CCG CTC GAC TTG GCC CAA GAG CGG GGA 425 
Arg Asp Ala Trp Gly Arg Leu Pro Leu Asp Leu Ala Gin Glu Arg Gly 
25 60 65 70 

CAT CAA GAC ATC GTG CGA TAT TTG CGT TCC GCT GGG TGC TCT TTG TGT 473 

His Gin Asp He Val Arg Tyr Leu Arg Ser Ala Gly Cys Ser Leu Cys 
75 80 85 



30 



SO 



TCC GCT GGG TGG TCT TTG TGT ACC GCT GGG AAC GTC GCC CAG ACC GAC 521 
Ser Ala Gly Trp Ser Leu Cys Thr Ala Gly Asn Val Ala Gin Thr Asp 
90 95 100 



3S GGG CAT AGC TTC AGC TCA AGC ACG CCC AGG GCC CTG GAA CTT CGC GGC 569 
Gly His Ser Phe Ser Ser Ser Thr Pro Arg Ala Leu Glu Leu Arg Gly 
105 110 115 

CAA TCC CAA GAG CAG AGC TAAATCCGCC TCAGCCCGCC TTTTTCTTCT 617 
40 Gin Ser Gin Glu Gin Ser 
120 125 

TAGCTTCACT TCTAGCGATG CTAGCGTGTC TAGCATGTGG CTTTAAAAAA TACATAATAA 677 

4S TGCTTTTTTT GCAATCACGG GAGGGAGCAG AGGGAGGGAG CAGAAGGAGG GAGGGAGGGA 737 

GGGAGGGACC TGGACAGGAA AGGAATGGCA TGAGAAACTG AGCGAAGGCG GCCGCGAAGG 797 

GAATAATGGC TGGATTGTTT AAAAAAATAA AATAAAGATA CTTTTTAATUV TGTCAA 853 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHZ^CTERISTICS : 
55 (A) L^GTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOIiECXJLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 6: 

Met Met Met Qly Asn Val His Val Ala Ala Leu Leu Leu Asn Tyr Gly 
15 10 15 

Ala Asp Ser Asn Cys Glu Asp Pro Thr Thr Phe Ser Arg Pro Val His 
20 25 30 

Asp Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu Val Val Leu His Gly 
35 40 45 

Ser Gly Ala Arg Leu Asp Val Arg Asp Ala Trp Gly Arg Leu Pro Leu 
50 55 60 

Asp Leu Ala Gin Glu Arg Gly His Gin Asp lie Val Arg Tyr Leu Arg 
65 70 75 80 

Ser Ala Gly Cys Ser Leu Cys Ser. Ala Gly Trp Ser Leu Cys Thr Ala 
85 90 95 

Gly Asn Val Ala Gin Thr Asp Gly His Ser Phe Ser Ser Ser Thr Pro 
100 105 110 

Arg Ala Leu Glu Leu Arg Gly Gin Ser Gin Glu Gin Ser 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 91.. 480 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTCCAGAGGG GAGGAGCCGC TCAGAGACCA GGCTGTAGCA ATCTCACGGC CGGCGAAGGA 60 

CCATTTCTGC CACAGACCGG GGACAAGGGC ATG TTG GGC GGC AGC AGT GAC GCG 114 

Met Leu Gly Gly Ser Ser Asp Ala 
1 5 

GGC CTG QCC ACC GCC GCG GCG CGG GGG CAA GTG GAG ACG GTG CGG CAG 162 
Gly Leu Ala Thr Ala Ala Ala Arg Gly Gin Val Glu Thr Val Arg Gin 
10 15 20 
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CTC CTG GAA GCC GGC GCA GAT CCC AAC GCC CTG AAC CGC TTC GGG AGG 210 
Leu Leu Glu Ala Gly Ala Asp Pro Asn Ala Leu Asn Arg Phe Gly Arg 
25 30 35 40 

S CGC CCA ATC CAG 6TC ATG ATG ATG GGC AGC GCC AGG GTG GCA GAG CTG 258 
Arg Pro He Gin Val Met Met Met Gly Ser Ala Arg Val Ala Glu Leu 
45 50 55 

CTG CTG CTC CAC GGA GCA GAA CCC AAC TGC GCC GAC CCT GCC ACC CTT 306 
10 Leu Leu Leu His Gly Ala Glu Pro Asn Cys Ala Asp Pro Ala Thr Leu 
60 65 70 

ACC AGA CCT GTG CAC GAC GCA GCT CGG GAA GGC TTC CTG GAC ACG CTT 354 
Thr Arg Pro Val His Asp Ala Ala Arg Glu Gly Phe Leu Asp Thr Leu 
IS 75 80 85 

GTC GTG CTG CAC CGG GCA GGG GCG CGG TTG GAT GTG TGT GAC GCC TGG . . 402 
Val Val Leu His Arg Ala Gly Ala Arg Leu Asp Val Cys Asp Ala Trp 
90 95 100 

20 

GGC CGC CTG CCG GTA GAC TTG GCT GAA GAG CAG GGC CAC CGT GAC ATT 450 
Gly Arg Leu Pro Val Asp Leu Ala Glu Glu Gin Gly His Arg Asp He 
105 110 115 120 

25 GCG AGG TAT CTG CAC GCT GCC ACT GGA GAT TGACTGCGGG TTCCCTCCGC 500 
Ala Arg Tyr Leu His Ala Ala Thr Gly Asp 
125 130 



30 



35 



40 



CTTCCGCAAG GACTTCTTTC TCCCCAGCCC CATCTAGGAA GACTGTAAGC ACGAAGAGGC 560 
CACCAGCGCC CAGCCTGCAG 580 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CH/^CTERISTICS : 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

45 Met Leu Gly Gly Ser Ser Asp Ala Gly lieu Ala Thr Ala Ala Ala Arg 
15 10 15 

Gly Gin Val Glu Thr Val Arg Gin Leu Leu Glu Ala Gly Ala Asp Pro 
20 25 30 

Asn Ala Leu Asn Arg Phe Gly Arg Arg Pro lie Gin Val Met Met Met 
35 40 45 



50 



Gly Ser Ala Arg Val Ala Glu Leu Leu Leu Leu His Gly Ala Glu Pro 
55 50 55 60 



Asn Cys Ala Asp Pro Ala Thr Leu Thr Arg Pro Val His Asp Ala Ala 
65 70 75 80 
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Arg Glu Gly Phe Leu Asp Thr Leu Val Val Leu His Arg Ala Gly Ala 
85 90 95 

Arg Leu Asp Val Cys Asp Ala Trp Gly Arg Leu Pro Val Asp Leu Ala 
100 105 110 

Glu Glu Gin Gly His Arg Asp lie Ala Arg Tyr Leu His Ala Ala Thr 
115 120 125 

Gly Asp 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQX3ENCE DESCRIPTION: SEQ ID NO: 9: 

Met Ala Thr Ser Arg Tyr Glu Pro Val Ala Glu lie Gly Val Gly Ala 
15 10 15 

Tyr Gly Thr Val Tyr Xaa Ala Xaa Asp Pro His Ser Gly His Phe Val 
20 25 30 

Ala Leu Lys Ser Val Arg Val Pro Asn Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Leu Pro lie Ser Thr Val Arg Glu Val Ala Leu Leu Arg Arg Leu Glu 
50 55 60 

Ala Phe Glu His Pro Asn Val Val Arg Leu Met Asp Val Cys Ala Thr 
65 70 75 80 

Ser Arg Thr Asp Arg Glu He Lys Val Thr Leu Val Phe Glu Xaa Val 
85 90 95 

Xaa Gin Asp Leu Arg Thr Tyr Leu Asp Lys Ala Pro Pro Pro Gly Ijeu 
100 105 110 

Pro Ala Glu Thr He Lys Asp Leu Met Arg Gin Phe Leu Arg Gly Leu 
115 120 125 

Asp Phe Leu His Ala Asn Cys He Val His Arg Asp Leu Lys Pro Glu 
130 135 140 

Asn He Leu Val Thr Ser Gly Gly Thr Val Lys Leu Ala Asp Phe Gly 
145 150 155 160 
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Leu Ala Arg He Tyr Ser Tyr Gin Met Ala Leu Thr Pro Val Val Val 
165 170 175 

Thr Leu Trp Tyr Arg Ala Pro Glu Val Leu Leu Gin Ser Thr Tyr Ala 
180 165 190 

Thr Pro Val Asp Met Trp Ser Val Gly Cys He Phe Ala Glu Met Phe 
195 200 205 

Arg Arg Lys Pro Leu Phe Cys Gly Asn Ser Glu Ala Asp Gin Leu Gly 
210 215 220 

Lys He Phe Asp Leu He Gly Leu Pro Pro Glu Asp Asp Trp Pro Arg 
225 230 235 240 

Asp Val Ser Leu Pro Arg Gly Ala Phe Pro Pro Arg Gly Pro Arg Pro 
245 250 255 

Val Gin Ser Val Val Pro Glu Met Glu Glu Ser Gly Ala Gin Leu Leu 
260 265 270 

Leu Glu Met Leu Thr Phe Asn Pro His Lys Arg He Ser Ala Phe Arg 
275 280 285 

Ala Leu Gin His Ser Tyr Leu His Lys Asp Glu Gly Asn Pro Glu 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Glu Lys Asp Gly lieu Cys Arg Ala Asp Gin Gin Tyr Glu Cys Val 
15 10 15 

Ala Glu He Gly Glu Gly Ala Tyr Gly Lys Val Phe Xaa Ala Xaa Asp 
20 25 30 

Leu Lys Asn Gly Gly Arg Phe Val Ala Leu Lys Arg Val Arg Val Gin 
35 40 45 

Thr Gly Glu Glu Gly Met Pro Leu Ser Thr He Arg Glu Val Ala Val 
50 55 60 

Leu Arg His Leu Glu Thr Phe Glu His Pro Asn Val Val Arg Leu Phe 
65 70 75 80 
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Aap Val Cys Thr Val Ser Arg Thr Asp Arg Glu Thr Lys Leu Thr Leu 
85 90 95 

Val Phe Glu Xaa Val Xaa 61n Asp Leu Thr Thr Tyr Leu Asp Lys Val 
100 105 110 

Pro Glu Pro Gly Val Pro Thr Glu Thr lie Lys Asp Met Met Phe Gin 
115 120 125 

Leu Leu Arg Gly I^eu Asp Phe Leu His Ser His Arg Val Val His Arg 
130 135 140 

Asp Leu Lys Pro Gin Asn lie Leu Val Thr Ser Ser Gly Gin lie Lys 
145 150 155 160 

Leu Ala Asp Phe Gly Ijeu Ala Arg He Tyr Ser Phe Gin Met Ala Leu 
165 170 175.. 

Thr Ser Val Val Val Thr Leu Trp Tyr Arg Ala Pro Glu Val Leu tieu 
180 185 190 

Gin Ser Ser Tyr Ala Thr Pro Val Asp Leu Trp Ser Val Gly Cys He 
195 200 205 

Phe Ala Glu Met Phe Arg Arg Lys Pro Leu Phe Arg Gly Ser Ser Asp 
210 215 220 

Val Asp Gin Leu Gly Lys He Leu Asp Val He Gly Leu Pro Gly Glu 
225 230 235 240 

Glu Asp Trp Pro Arg Asp Val Ala Leu Pro Arg Gin Ala Phe His Ser 
245 250 255 

Lys Ser Ala Gin Pro He Glu Lys Phe Val Thr Asp He Asp Glu Leu 
260 265 270 

Gly Lys Asp Leu Leu Leu Lys Cys Leu Thr Phe Asn Pro Ala Lys Arg 
275 280 285 

He Ser Ala Tyr Ser Ala Leu Ser His Pro Tyr Phe Gin Asp Leu Glu 
290 295 300 

Arg Cys Lys Glu Asn Leu Asp Ser His Leu Pro Pro Ser Gin Asn Thr 
305 310 315 320 



Ser Glu Leu Asn Thr Ala 
325 
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1. An isolated or recombinant cell-cycle regulatory polypeptide comprising an amino 
acid sequence characterized by ankyrin-like repeats, which polypeptide binds to a 
cyclin-dependent kinase (CDK), with the proviso that said polypeptide is not identical 

S in amino acid sequence to SEQ ID No. 2. 

2. The polypeptide of claim 1, wherein said CDK is a Gl phase CDK. 

3. The polypeptide of claim 1» wherein said CDK is CDK4 or CDK6. 

10 

4. The polypeptide of claim 1, which polypeptide has an apparent molecular weight in 
the range of 12kD to 20kD by SDS-PAGE, or is a CDK-bmding fragnient of a full 
length protein having a molecular weight in the range of 12kD to 20kD by SDS- 
PAGE. 

15 

5. The polypeptide of claim 1, which polypeptide competitively displaces the protein 
designated by SEQ ID No. 2 from binding to a CDK. 

6. A substantially pure prq>aration of a cell-cycle regulatory (CCR) protein, or a 
20 fragment thereof, which CCR-protein specifically binds to a cyclin-dependent kinase 

(CDK), the fiill-lengtfa form of said CCR-protein having an approximate molecular 
weight in the range of 12kD to 20kD, with the proviso that said CCR-protein is not 
identical in sequence to SEQ ID No. 2. 

25 7. The CCR-protein of claim 6, which protein comprises an amino acid sequence at least 
60% homologous to an amino acid sequence represented in SEQ ID No. 2. 

8. The CCR-protein of claim 6, which protein comprises an amino acid sequence at least 
60% homologous to an amino acid sequence represented in SEQ ID No. 4. 

30 

9. The CCR-protein of claim 6, which protein comprises an amino acid sequence at least 
60% homologous to an amino acid sequence represented in SEQ ID No. 8. 

10. The CCR-protein of claim 6, which protem comprises an amino acid sequence 
35 represented by the general formula: 

Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa- 
Xaa-Gly-Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa- 
Xaa-Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp- 
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Thr-Leu-Val-Val-Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val- 

Arg-Asp-Ala-Trp-Gly-Arg-Leu-Pro-Xaa-Asp-I»eu-Ala-Xaa-Glu- 

Xaa-Gly-His-Xaa-Asp-Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa- 
Gly. 

5 



11. The CCR-protein of claim 6, which protein functions in one of either role of an 
agonist of cell-cycle regulation or an antagonist of cell-cycle regulation. 

12. A substantially pure preparation of a pl6 homolog polypeptide, or a fragment thereof, 
10 comprising an amino acid sequence at least 60% homologous to SEQ ID No. 2, with 

the proviso that said polypeptide is not identical in sequence to SEQ ID No. 2. 

13. The polypeptide of claim 12, including an amino acid sequence repiesented by the 
general formula: 

15 Met-Asp-Pro-Ala-Ala-Gly-Ser-Ser-Met-Glu-Pro-Ser-Ala-Asp- 



Trp-Leu-Ala-Thr-Ala-Ala-Ala-Arg-Gly-Arg-Val-Glu-Glu-Val- 
Arg-Ala-Leu-Leu-Glu-Ala-Val-Ala-Leu-Pro-Asn-Ala-Pro-Aen- 
Ser-Tyr-Gly-Arg-Arg-Pro-Ile-Gln-Val-Met-Met-Met-Gly-Xaa- 
Xaa-Xaa-Val-Ala-Xaa-Leu~Leu-Leu«-Xaa-Xaa-Gly-Ala-Xaa-Xaa- 

20 Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val-His- 
Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu-Val-Val-Leu- 
His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly- 
Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp- 
Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly-Gly-Thr-Arg-Gly- 

25 Ser-Asn-His-Ala-Arg-Ile-Asp-Ala-Ala-Glu-Gly-Pro-Ser-Asp- 
Ile-Pro-Asp. 



14. The polypeptide of claim 12, wherein said polypeptide which functions in one of 
either role of an agonist of cell cycle regulation or an antagonist of cell cycle 

30 regulation. 

15. A substantially pure preparation of a pi 5 polypeptide, or a fragment thereof, 
comprising an amino acid sequence at least 60% homologous to SEQ ID No. 4, with 
the proviso that said polypeptide is not identical in sequence to SEQ ID No. 2. 

35 

16. The polypeptide of claim 15, including an amino acid sequence represented by the 
general formula: 

Met-Arg-Glu-Glu-Asn-Lys-Gly-Met-Pro-Ser-Gly-Gly-Gly-Ser- 
40 Asp-Glu-Gly-Leu-Ala-Thr-Pro-Ala-Arg-Gly-Leu-Val-Glu-Lys- 
Val-Arg-His-Ser-Trp-Glu-Ala-Gly-Ala-Asp-Pro-Asn-Gly-Val- 
Asn-Arg-Phe-Gly-Arg-Arg-Ala-Ile-Gln-Val-Met-Met-Met-Gly- 
Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa-Xaa-Gly-Ala-Xaa- 
Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val- 
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His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-Leu-Val-Val- 
Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp- 
Gly-Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa- 
Asp-Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly-Asp. 

17. The polypeptide of claim 15, wherein said polypeptide which functions in one of 
either role of an agonist of cell cycle regulation or an antagonist of cell cycle 
regulation. 



10 18. A isolated or recombinant polypeptide comprising an amino acid sequence 
homologous to an amino acid represented in one or more of SEQ ID Nos. 2, 4 or 8, 
which polypeptide specifically interacts with cyclin-dependent kinase 4 (CDK4), 
which specific interaction is affected by residues K22, R24, H95 and D97 of CDK4, 
with the proviso that said CCR-protem is not identical in sequence to SEQ ED No. 2. 

15 

19. The polypeptide of claim 1 8, which polypeptide inhibits phosphoryl transfer reactions 
mediated by CDK4, 

20. An antibody preparation specifically reactive with an epitope of the CCR-protein of 
20 claim 6. 



21. An antibody preparation specifically reactive with an epitope of the polypeptide of 
SEQ ID No. 4. 

25 22. An antibody preparation specifically reactive with an epitope of the polypeptide of 
SEQ ID No. 8. 

23. A recombinant polypeptide having an amino acid sequence at least 60% homologous 
to an amino acid sequence represented in at least one of SEQ ID Nos. 2, 4 or 8, with 

30 the proviso that said CCR-proteui is not identical in sequence to SEQ ID No. 2. 

24. The polypeptide of claim 23, vAdch polypeptide functions in one of either role of an 
agonist of cell cycle regulation or an antagonist of cell cycle regulation. 

35 25. The polypeptide of claim 23, which polypeptide binds to a cyclin dependent kinase. 

26. The polypeptide of claim 25, wherein the cyclin-dependent kinase is CDK4, and 
binding of the polypeptide to CDK4 is affected by residues K22, R24, H95 and D97 
ofCDK4. 



40 



wo 95/28483 PCTAJS95/04636 

-89- 

27. The polypeptide of claim 26, which polypeptide inhibits phosphoiyl transfer reactions 
mediated by CDK4. 

28. The polypeptide of claim 23, comprising an amino acid sequence represented by the 
5 general formula: 

Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa- 
Xaa-Gly-Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa- 
Xaa-Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp- 
10 Thr-Leu-Val-Val-Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val- 
Arg-Asp-Ala-Trp-Gly-Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu- 
Xaa-Gly-His-Xaa-Asp-Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa- 
Gly. 

IS 29. The polypeptide of claim 23, compr\sing an amino acid sequence represented by the 
general formula: 

Met-Asp-Pro-Ala-Ala-Gly-Ser-Ser-Met-Glu-Pro-Ser-Ala-Asp- 
Trp-Leu-Ala-Thr-Ala-Ala-Ala-Arg-Gly~Arg-Val-Glu-Glu-Val- 
Arg-Ala-Leu-Leu-Glu- Ala- Val -Ala-Leu- Pro- Asn-Ala-Pro-Asn- 

20 Ser-Tyr-Gly- Arg- Arg- Pro- He -Gin- Val -Met -Met -Met - Gly-Xaa - 

Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa-Xaa-Gly-Ala-Xaa-Xaa- 
Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val-His- 
Asp-Ala-Ala-Arg-Glu-Gly-Phe-Leu-Asp-Thr-lieu-Val-Val-Leu- 
His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly- 

25 Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp- 
Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly-Gly-Thr-Arg-Gly- 
Ser-Asn-His-Ala-Arg-Ile-Asp-Ala-Ala-Glu-Gly-Pro-Ser-Asp- 
Ile-Pro-Asp. 

30 30. The polypeptide of claim 23, comprising an amino acid sequence represented by the 
general formula: 

Met-Arg-Glu-Glu-Asn-Lys-Gly-Met-Pro-Ser-Gly-Gly-Gly-Ser- 
Asp-Glu-Gly-Leu-Ala-Thr-Pro-Ala-Arg-Gly-Leu-Val-Glu-Lys- 

35 Val-Arg-HiB-Ser-Trp-Glu-Ala-Gly-Ala-Asp-Pro-Asn-Gly-Val- 
Asn-Arg-Phe-Gly-Arg-Arg-Ala-Ile-Gln-Val-Met-Met-Met-Gly- 
Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu-Xaa-Xaa-Gly-Ala-Xaa- 
Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr-Xaa-Xaa-Xaa-Arg-Pro-Val- 
His-Asp- Ala-Ala-Arg-Glu-Gly- Phe-Leu- Asp-Thr-Leu- Val - Val - 

40 Leu-His-Xaa-Xaa-Gly-Ala-Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp- 
Gly-Arg-Leu-Pro-Xaa-Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa- 
Asp-Xaa-Xaa-Xaa-Tyr-Leu-Arg-Xaa-Ala-Xaa-Gly-Asp. 
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31. The polypeptide of claim 23, which polypeptide is a fusion protein further 
comprising, a second polypeptide portion having an amino acid sequence from a 
protein unrelated the protein of SEQ ID No. 2, 4 or 8. 

S 32. The polypeptide of claim 3 1 , wherein said fusion protein is functional in a two-hybrid 
assay. 

33. A substantially pure nucleic acid having a nucleotide sequence which encodes a cell- 
cycle regulatory polypeptide comprising an amino acid sequence characterized by 

10 ankyrin-like repeats, which polypeptide binds to a cyclin-dependent kinase (CDK), 

with the proviso that said polypeptide is not identical in amino acid sequence to SEQ 
ID No. 2. 

34. The nucleic acid of claim 33, wherein said CDK is a Gl phase CDK. 

15 

35. The nucleic acidof claim 33, wherein said CDK is CDK4 or CDK6. 

36. The nucleic acidof claim 33, which polypeptide has an apparent molecular weight in 
the range of 12kD to 20kD by SDS-PAGE, or is a CDK-binding fragment of a full 

20 length protein having a molecular weight in the range of 12kD to 20kD by SDS- 

PAGE. 

37. A substantially pure nucleic acid having a nucleotide sequence which encodes a ceU 
cycle regulatory (CCR) protein, or a fragment thereof, which specifically binds a 

25 cyclin-dependent kinase (CDK), the full-length form of said CCR-protein having an 

approximate molecular weight in the range of 12kD to 20kD, with the proviso that 
said polypeptide is not identical in amino acid sequence to SEQ ID No. 2. 

38. The nucleic acid of claim 37, wherein said CCR-protein encoded by said nucleotide 
30 sequence has an ammo acid sequence at least 60% homologous to an amino acid 

sequence represented in SEQ ID No. 2. 



35 



39. 



The nucleic acid of claim 37, wherein said CCR-protein encoded by said nucleotide 
sequence has an amino acid sequence at least 60% homologous to an amino acid 
sequence represented in SEQ ID No. 4. 
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40. The nucleic add of claim 37, wherein said CCR-protein encoded by said nucleotide 
sequence has an amino acid sequence at least 60% homologous to an amino acid 
sequence Tq>resented in SEQ ID No. 8. 

5 41. The nucleic acid of claim 37, wherein said CCR-protein encoded by said nucleotide 
sequence has an amino acid sequence represented by the general formula: 

Met-Met-Met-Gly-Xaa-Xaa-Xaa-Val-Ala-Xaa-Leu-Leu-Leu- 
Xaa-Xaa-Gly-Ala-Xaa-Xaa-Asn-Cys-Xaa-Asp-Pro-Xaa-Thr- 
Xaa-Xaa-Xaa-Arg-Pro-Val-His-Asp-Ala-Ala-Arg-Glu-Gly- 
10 Phe-Leu-Asp-Thr-Leu-Val-Val-Leu-His-Xaa-Xaa-Gly-Ala- 
Arg-Leu-Asp-Val-Arg-Asp-Ala-Trp-Gly-Arg-Leu-Pro-Xaa- 
Asp-Leu-Ala-Xaa-Glu-Xaa-Gly-His-Xaa-Asp-Xaa-Xaa-Xaa- 
Tyr-Leu- Arg-Xaa-Ala-Xaa-Gly . 

15 42. The nucleic add of claim 37, wherein said CCR-protein encoded by said nucleotide 
sequence functions in one of dther role of an agonist of cell cycle regulation or an 
antagonist of cell cycle regulation. 

43. The nucleic acid of claim 37, wherein said nucleotide sequence hybridizes under 
20 stringent conditions to a nucleic acid probe corresponding to at least 12 consecutive 

nucleotides of one or more of SEQ ID No. 1, SEQ ID No. 3 or SEQ ID No. 7. 

44. A probe/primer comprising a substantially purified oligonucleotide, said 
oligonucleotide containing a region of nucleotide sequence which hybridizes under 

25 stringent conditions to at least 10 consecutive nucleotides of sense or antisense 

sequence of SEQ ID No. 3 or 7, or naturally occuring mutants thereof. 

45. The probe/primer of daim 44, further comprising a label group attached thereto 
and able to be detected. 

30 

46. The probe/primer of claim 45, wherein said label group being selected from a group 
consisting of radioisotopes, fluorescent compounds, enzymes, and enzyme co-factors. 

47. A diagnostic test kit for identifying an transformed cells, comprising the piobe/primerof 
35 claim 44, for measurmg, in a sample of cells isolated from a patient, a level of a nucleic 

acid encoding a cell-cycle regulatory protein. 



40 



48. An assay for screenmg test compounds for an inhibitor of an interaction of a cell- 
cycle regulatory (CCR) polypeptide with a cyclin dependent kinase (CDK) 
comprising 
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i. combining a CDK, and a CCR polypeptide comprising an amino acid. 

sequence characterized by ankyrin-Iike repeats, which polypeptide binds to 

said CDK, under conditions wherein said CDK and said CCR polypeptide are 

able to interact; 
. ii. contacting said combination with a test compoimd; and 
iii. detecting the formation of a complex comprising said CDK and said CCR 

polypeptide, 

wherein a decrease in the formation of said complex in the presence of said test 
compoimd is indicative of an inhibitor of the interaction between a CDK and a CCR 
polypeptide. 

A method of identifying an agent v4iich disrupts the ability of a CCR protein to 
regulate a eukaryotic cell cycle, comprising: 

i. providing a two-hybrid assay system including (a) a first fusion protein 
comprising a cyclin-dependent kinase (CDK), and (b) a second fusion 
protein comprising a CCR polypeptide polypeptide comprising an amino 
acid sequence characterized by ankyrin-like repeats, which polypeptide 
binds to said CDK, imder conditions wherein said two hybrid assay is 
sensitive to intemctions between the cdk of said first fusion protein and said 
pl5lNK4 polypeptide of said second polypeptide; 

ii. contacting said two-hybrid assay system with a candidate agent; 

iii. measuring a level of interactions between said fusion proteins in the 
presence of said candidate agent; and 

iv. comparing the level of interaction of said fusion proteins in the presence of 
said candidate agent to a level of interaction of said fusion proteins in tiie 
absence of the candidate agent, 

wherein a decrease in the level of interaction in the presence of said candidate agent is 
indicative of inhibition of an interaction of a CCR protein with a cyclin dependent 
kinase. 

A method of determining if a subject is at risk for a disoider characterized by 
imwanted cell proliferation, comprising detecting, in a tissue of said subject, the 
presence or absence of a genetic lesion characterized by at least one of 

a mutation of a gene encoding a CCR protein, and the mi s-e)qpression of said 
gene, wherein said CCR protein has an ammo acid sequence characterized by ankyrin- 
like repeats, and which polypeptide binds to a cyclin-dependent kinase (CDK) 
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51. The method of daixn 50, wherein deteaing said genetic lesion comprises 
ascertaining the existence of at least one of 

i. a deletion of one or more nucleotides from said gene, 

ii. an addition of one or more nucleotides to said gene, 

5 iii. an substitution of one or more nucleotides of said gene, 

iv. a gross chromosomal rearrangement of said gene. 
V. a gross alteration in the level of a messanger RNA transcript of said 
gene, 

vi. the presence of a non-wild type splicing pattern of a messanger RNA 
1 0 transcript of said gene, and 

vii. a non-wild t3rpe level of said protein. 

52. The method of daim 50, wherein detecting said genetic lesion comprises 

i. providing a probe/primer comprising an oligonucleotide containing a 
15 region of nucleotide sequence which hybridizes to a sense or antisense 

sequence of SEQ ID No, 1 or 3, or naturally occuring mutants thereof, or 5' 
or 3* flanking or intronic sequences naturally associated with said gene; 

ii. exposing said probe/primer to nucleic acid of said tissue; and 

iii. deteaing, by hybridization of said probe/primer to said nucleic acid, the 
20 presence or absence of said genetic lesion. 

53. The method of claim 50, wherein detecting said lesion comprises utilizing said 
probe/primer to determine the nucleotide sequence of said gene and, optionally, of 
said flanking nucleic acid sequences. 

25 

54. The method of claim 50, wherein detecting said lesion comprises utilizing said 
probe/primer to in a polymerase chain reaction (PGR) or a ligation chain reaction 
(LCR). 

30 55. The method of claim 51, wherein the level of said protein is detected in an 
immimoassay. 

56. A method of determining a risk of cellular transformation of a cell, comprising 
detecting m the cell the presence of a protein complex between a cyclin dependent 
35 kinase (CDK) and a polypeptide comprising an ammo acid sequence characterized by 

ankyrin-like repeats, which polypeptide binds to said CDK, with the proviso that said 
polypeptide is not identical in sequence to SEQ ID No. 2. 
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EJC12 NIE16:2 

CGGAGAGGGAATTCGGCACGAGGCAGCATGGAGCCTTCGGCTGACT 

S ^ Bd 

GGCTGGCCACGGCCGCGGCCCGGGGTCGGGTAGAGGAGGTGCGGGC 

Z Ex13 



GCTGCTGGAGGCGGTGGCGCTGCCCAACGCACCGAATAGTTACGGT 

NTp1g.3 \ Ex14 
CGGAGGCCGATC CAGGTCATGGATGATGGGCAGCGCCCCGAGTGGC 
^ Ex2 ^ 

GGAGCTGCTGCTGCTCCACGGCGCGGAGCCCAACTGCGCCGACCCC 



^ P16INT 

GCCACTCTCACCCGACC CGTGCACCACGCTGCC CGGGAGGGCTTCT 

NTp16.5 >v ^ 



GGACACGCTGGTGGTGCTGCACCGGGCCGGGGCGCQGCTGGACGTG ^ 

Z E2^2 o 

CGCGATGCCTGGGGCCGTCTGCCCGTGGAC^ n 

2 

GCCATCGCGATGTCGC ACGGTACC TGCGCGCCCGTGCGGGGGGCAC 

^ gx15 



CAGAGGCAGTAACCATGCCCGCATAGATGCCGCGGAAGGTCCCTCA 

^ 

GACATCCCCGATTGAAAGAAC CAGAGAGGCTCTGAGAAACCTCGGG 

^ Ex5 ^ 

AAACTTAGATCATCAGTCACCGAAGGTCCTACAGGGCCACAACTGC 

CCCCGCCACAACCCACCCCGCTTTCGTAGTTTTCATTTAGAAAATA 

GAGCTTTTAAAAATGTCCTGCCTTTTAACGTAGATATAAGCCTTCC 

CCCACTACCGTAAATGTCCATTTATATCATTTTTTATATATTCTTA 

TAAAAATGTAAAT^GAAAACACCGCTTCTGCCTTTTCACTGTGTT 



Figure I A 
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GGAGTTTTCTGGAGTGAGCACTCACGCCCTAAGCGCACATTCATGT 

GGGCATTTCTTGCGAGCCTCGCAGCCTCCGGAAGCTGTCGACTTCA 

^ ExZ 

TGACAAGCATTTTGTGAACTAGGGAAGCTCAGGGGGGTTACTGGCT 

TCTCTTGAGTCACACTGCTAGCAAATGGCAGAACCAAAGCTCAAAT 

AAAAATAAAATTATTTTCATTCATTCACTCAAAAAAA 



Figure IB 
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pl6exi 
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pl6exl 
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pl6exl 
pl6exl3 



pl6exl 
pl6exl3 



pl6NT2 
pl6exl 
pl6exl3 



pl6nt3 
pl6nt2 
pl6exl3 



pl6nt3 
pl6nt2 
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< GGNGGNAAGNTGTGGGGGAAAGTTTGGGGATGGAANACCAANCCCTCCTTTC^ 
+ + + + + + 

< AOICTGGCTCrGNCGAGGCTNCOTCCGANTGGTNCCCCCGGGGGAGACCC^ 

< GACTTCAGGGNTGCWACATTCACrAAGTGCTNGGAGNTAATANCACCrCCT 

TCa^CTTATTGOTAGQANATAATAACACCTCCACCGATAACT 

+ + + + + + 

< TCGCTCACyiGCGTCCCCTTACCTNGANAGATACCNQSXQXTCCCTCCA 

< TOlCnrrACAACGTCCCOTnTCCTGGAAAGATACAC^ 

+ + + + + + 

< GAaVGGNTCGGAGGGGGCTCTTCCCCCANCACCGGAGGAAGAAAGAGG^ 

< GACAGGGTNGGAGNGGTCTCTTCOTCCACCACCGGAGGAAGAAAGAGGAGGGGCTGNCT 

+ + + + + + 

BxlA--(12)— > 

< GTCACCAGAGGGTGGGACGGACCGCGTGCGCTCGGCGNCTNCGGAGAGGGGGAGAACAGA 

< TTCACCAGAGGGTGGGACGGACCNCGTACGCTCGNCGNCTNCGGAGAGGGGGAGAGGAGT 

+ + + + + + 

< a^CGGGCGGCGGGGAGCAGCATGGATCCGGCGGCGGGGAGCAGCTVTGGANCCITCGA 

< a^CGCarCGNCGGGGAGCAAOlTGGAACCGNCGGOGGGGAGCAG^ 

+ + + + + + 

< GACNNNCTCCGGCCGGNGTCGGGTAGAGGAGGTGCGGGCGCTGCTGGAG 

< GACTGACTGCCTCGC 

< GACTGGCTGNCCACGNCCACGNCCCGGGGTCGGGTAGAGGAGGTGCGGNCGCTNC^ 
+ + + + + + 

< Bxl3 

> CTCTNANCCCGGGTA 

< GCGGGGGCXSCTGCCCAACGCTlCCGAATAGTTACGGTCGGAGGCaSAT^ . 

< GCGGGGNCTCTGNCCAACNCGCTAAT^ 

+ + + + + + 

> GAGGGTCTGCAGCGGGAGCAGNGGATGGCGGGCGACTCTGGAGGACGAAGTTGGCAGGGG 

< GAGGGTCTGCAGCGGGAGCTIGGGGATGGCGGGCGACTCTGGAGGACGAAGTTTGCAGGGG 

+ + + + + + 

< ExlB 

> AATTGGAATCAGGTAGCGCTTCGAlsrrCTCCGGAAAAAGGGGAGGCTTCCTGGGGAGTTN^ 

< AATTGGAATCAGGTAGCGCTTCGATTCTCCNGAAAAAGGGGAGGCTTCCTGGGGAGTTTT 



Figure 2 A 
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pl6nt3 > CAGAAGGGGTTTGTAATCACAGNCCTCCNCCn'GGaSACGCCCTGGGGGG 
pl6nt2 < CaiGAAGGGGTTTGTAATCAa^GACCTCCrCCTGGCXSACGTCCTG^ 



pl6nt3 
pl6nt2 



> AGGAAGAGGAATGAGGAGNCACGCGCmACAGNTCTCTCGAATNCrGANAAGATCT 

< AGGAAGAGGAATNAGGAGCCACGCGCGTACAGATCTCTCGAATGCTGAGAAGATCr^^ 



pl6nt3 
pl6nt2 



> GGGGGAACATATTTGTATTAGxATNNAAGTATGCTCnTTATCAGATACi^AAATTCACG^ 
< GGGGGAACATATTTGTATTAGCOTCCAAGTOTWCTCTNTATCANATACa^AAOT 



pl6nt3 
pl6nt3 



> CGTGTGGNATAAAAAGGGAGTCTTAAAGAAATNTAAGATGTGCTGGGACTACTTAGCCTC 

> CAANACACAGATNCCTGGATGGAGCT 



Figure 2B 
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pieint > AAAAmAAAAAAAATCTCCCAGGCCTAACATAATTOTCAGGAAAGAAATTTCAGTAGTTG 

+ + + + + + 

pieint > NATCTCAGGGGAAATACAGGAAGTTAGCCTGGAGTAAAAGTCAGTCTGTCCCrrGCCCC^ 

. + '¥ + + + + 

pieint > TGCTANATTGCCCGTGCCTCACAGTGCTCTCTGCCTGTGACGACAGCTCCNCAGAAGTTC 
+ + + + + + 

pl6int > GGAGGATATAATGGAATTCATTGTGTACTGAAGAATGGATAGAGAACTCAAGAAGGAAAT 

+ + + + + + 

pl6int > TGGAAACTGGAAGCAAATGTAGGGGTAATTAGACACCTGGGGCTTGTGTGGGGGTCTGCT 
pl6exl5 < AANAAAAAAGAAATNGATAANATAGAGGAT 



EX2A ---> 

pl6 int > TGGCGGTGAGGGGGCTCTAaVCAAGCTTCCTTTCCGTCATGCCGNCCCCCACCCTGGCTC 
pl6exl5 < GAACANATTAAAATCAAAAAACANAACANAGACATAAT 



EX14 

pieint > TGACCATTCTGTTCrCTCTGGOlGGTCATGATGATGGGCAGCGCCaSAGTGGCC^ 
pl6exl5 < NTAATCATAATTATAAAGGTCAAGACTCATTGATATllAAGGA 



-> 

pieint > CTGCTGCTCCACGGCGCGGAGCCCAACTGCTCCGACGCCG 

pl6ex2 > CCTGCNACGACCCCGCCACTCTCACCCGACCCGTG 
pieexl4 > NCTCTCACGGTGGGGAGGCCAACTGCGCCGAACCCGCCACTCTCACCCGACCCGCG 
pieexlS < ACTAGCACAAlTOGNATNAAAAAANAATTCCCyVCGAaiCCGCCACTCTC^ 



pieex2 > CACGACGCTGTCCGGGAGGGTTTCCTGGACACGCTGGTGGTGCTGCACCGGGCCGGGGNG 
pieexl4 > CACGACGGTGCCCGGGAGGGGTTCCTGGACACGCTGGTGGTGCTGCACCGGGCCGGGGCG 
pl6exl5 < CTCGACACTGCCCGGGAGGTCNTCCTGGACTVCGCTGGTGGTNCTCCACCGGNCC^^ 



pl6ex2 > OSGTTGGACGTGCGCGATGCCTGGGGCCGCCTNCCCGTGGXACCTGGTTGAGGAGCTGGG 
pl6exl4 > CGGCTGGACGTTCGNGATGCCTGGGGGCOTCTNTCCGTNGXACCTGGCTGAAGAGCTGGN 
pieexlS < OSTCTGGACGTGCGCGATGCCTGGGNCCGNCTACCCGTGGTACCTGACTGAGGACCTGGG 



pieex2 > NCATCGCGATGTCGCACGGTACCTGCGCGCGGTTGCGGGGGGCACCAGAGGXNAGTNACC 
pieexl4 > NCATCGNGATGTCGCACGGCOJCTGTGTGNGGNTGCGGGGGGCACCATAGGTCAGTOT 
pl6exl5 < CCATCCCGATTTCGCNGGGTANCTGNGNGNGGCTGNGGGGGCaU^AGAGGXC^^ 



Figure 2C 
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P16EX5 < XAAGTATGAGCGAAACNAATTGTGGTTTGAGAANAGGNAATCGTAGGGAACTTCGGGATC 
+ + + + + + 

P16EX5 < CCNCNGGGANCNCCAGAACCTGAGNCGCCmTTGGAAATNAa^ 

+ + + + + + 

P16EX5 < CGNACCAGOTNCaAAAGATACCrGGGGANGCGGGAAGGGAAAGAOTAC^ 

P16EX9 < CCCC 

+ + + + + + 

P16EX5 < TTCGGNCCTXGGNATTGTGAGCAGCCTCTGAGACTCATTXATATNAOV 
P16EX9 < ATCGCGCCTTGGGAOTGTGAGOIACCT^TTGAGACTCATNAATATAGC^ 

. + + + + + + 

P16EX5 < CITAa^CCCTGCGGNCCGCGCGGTCGCGCTITCT 

P16EX9 < CTTGOVACCCTGCGGNCCGCGCGGTCGCGCnTITCTCTGCCCrCCGC^ 

+ + + + + 

P16EX5 < GCGCTTGAGCGGTCGGCGCGCCTGGAGCAGCCAGGOSGNCAGTGGACTAGCT 
P16EX9 < GCGCTTGAGCGGTCGGCGCNCCTGGANCAGCCAGGaSGGCAGTGGACT 

+ + + + + + 

P16EX5 < AGGGAGGTGTGGGAGAGCGGTGGCGGCGGGTACATGCACGTGAAGCCT^TTGCGAGAAC^ 
P16EX9 < AGGGAGGTGTGGGAGAGCGGTGNCGGCGGGTACATGCACGTGAAGCCATTGCGAGAACTT 

+ + + + + + 

P16EX5 < TATCCATAAGTATTTCAATACCGGTAGGGACGGCAAGAGAGGAGGGCGGGATGTGCCAC^ 
P16EX5 < TATCC:ATAAGTATTTCAATGCCGGTAGGGACGGCAAGAGAGGAGG6CGGGATGTNCa\^ 

+ + + + + + 

P16EX5 < CATCTTTGACCTCAGGTTTCTAACGCCTGTTTTCTTTCTGCCCTCTGC^ 

P16EX9 < CATCTTTGACCTCAGGTTTCTAACGCCTGTTTTCTTTCTGCCCTCTGCAGACyVTCCCC^ 

+ + + + + + 

P16EX4 > AGAAATTAGATCATCAGTCACCGATG 
P16EX5 < TTGAAAGAACCAGAGAGGCTCTGAGAAACC 
' P16EX9 < TTGAAAGAACCAGAGAGGCTCn'GAGAAACCTCCGGAAACrTAGXTCT^TCAXTC^ 

+ + + + + + 

P16EX4 > GTCCTACAGGGNCACAACrGNCCCCGCCACAACCCACCCCGlSr^ 
P16EX9 < AA 



Figure 3 A 
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P16EX4 > GAAAATAGAGCTTTTAAAiUVTGTCCTGCCTTTTAACGTAGATATATGCCTTCCCCCACTA 
+ + + + + + 

P16EX4 > CCGNAAATGTCCATTTATATCaVTNTTTTATATATTClTATAAA;^ 

+ + + + + + 

P16EX4 > CACCGCTTCTGCCnTrTCTVCTGTGTTGGAGTTTTCT 

+ + + + + + 

PI 6EX6 > CANC^TATNTNCGGCATTTCTNGNGAGCCTCGTAGTCTCCGGATGNTGTC^ 
P16EX6A > aUJCraVTNTNCGGCATTTCTNG^ 

.P16EX4 > GCaiCATTCATGTGGGCATTTCnTGCGAGCCTaSCAGNCTCCGGAAGCT 

+ + + + + + 

P16EX6 > GGGGGGNCasTGTACCCAATTCGNCCTATNGTGAGTCGTNTTAC^ 

P16EX6A > GGGGGGNCOTGTACCCAATTCGNCCTATNGTGAGTCGTNTTACAATTCACTGGCCGCCGT 

P16EX4 > GGGGGGNCCGGTACCCAATTCGCCCTATAGTGAGTCGTATTACTATTCACTGGNq^^^ 

+ + + + + .+ 

P16EX6 > TTTXACAACGTCGXTGXACTGGGAAAACCCTGGTGTTACCCT^CTTXAATCGCCT^ 
P16EX6A > TTTXACSU^CGTCGXTGXACTGGGAAAACCCTGGTGTTACCCAACTT)^ 
P16EX4 > TTTTACAACGTCGGTGGACTGGGAAAACCCCGGNGTTACCCAACTTTAATC^ 

+ + + + + . # + 

P16EX6 > NACATCCCCCITIXCGCCAGCTGGTGTAATAGCGANGA 

P16EX6A, > NACATCCCCCTTTXCXSCCAGCraGTGTAATAGaSA^ 

P16EX4 > GACATCCCCCTTTTCGCCAGNTGGGGTTATAGNGAAGAGGGCaiCACC^^ 

+ + + + + . + 

P16EX6 > CAACAGTTGNGCAGCCTGAATGGCGAATGGAAATTGTAAGCGTTAATATTTTGTTAAAAT 
P16EX6A > CAACAGTTGNGCAGCCTGAATGGCGAATGGAAATTGTAAGCGTTAATATTTT 

+ + • . + + + + 

P16EX6 > TCGCGTTANATCOTCGGTTAANTCAGCTCATNTTTTATCC^ 
P16EX6A > TCGCGTTANATCNTCGGTTAANTCAGCTCATNTTTTATCaA^ 

+ + 4- -f -I- 

P16EX6 > ATCCCCAATAAATCAANAGAATAGACCGAGATAGGGTTGAGTGTCGTTCCAGTTNGGGAA 
P16EX6A > ATCCCCAATAAATCAANAG2ATAGACCGAGATAGGGTTGAGTGTCGTTCCAGTTNGGGAA 

+ + + + + + 

P16EX6 > CANGAGTCCACTATTAAAGANCGTAGNCTCNAACGTCANAGGGCGAAAAACCNT^^ 
P16EX6A > CANGAGTCCACTATTAAAGANCGTAGNCTCNAACGTa^GGGCGAAAAACCOT'^^ 



Figure SB 
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P16EX6 > GNGGATTGGNCCACTACGCNTANCC 

P16EX6A > GNGGATTGGNCCACTACGCNTANCCATCACCCTATTC 



Figure 3C 
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Figure 5 
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